Q Learning Algorithm - 搜索 News

A Weighted Smooth Q-Learning Algorithm

Abstract: Q-learning and double Q-learning are well-known sample-based, off-policy reinforcement learning algorithms. However, Q-learning suffers from overestimation bias, while double Q-learning ...

Frontiers

Using reinforcement learning in genome assembly: in-depth analysis of a Q-learning assembler

Genome assembly remains an unsolved problem, and de novo strategies (i.e., those run without a reference) are relevant but computationally complex tasks in genomics. Although de novo assemblers have ...

Semiconductor Engineering

SpiNNaker2 Neuromorphic Platform: HW-Aware Fine-Tuning of Spiking Q-Networks (TU Dresden Et ...

A new technical paper titled “Hardware-Aware Fine-Tuning of Spiking Q-Networks on the SpiNNaker2 Neuromorphic Platform” was published by researchers at TU Dresden, ScaDS.AI and Centre for Tactile ...

Frontiers

A novel reinforcement learning framework-based path planning algorithm for unmanned surface ...

Unmanned surface vehicles (USVs) nowadays have been widely used in ocean observation missions, helping researchers to monitor climate change, collect environmental data, and observe marine ecosystem ...

Scientific Research Publishing

Multi-Agent Strategic Confrontation Game via Alternating Markov Decision Process Based ...

To provide quantitative analysis of strategic confrontation game such as cross-border trades like tariff disputes and competitive scenarios like auction bidding, we propose an alternating Markov ...

IEEE

Adaptive Q-Learning Algorithm for Optimal Load Frequency Control in Single-Area Power ...

Abstract: This paper presents a novel Q-learning algorithm to address the optimal load frequency control (LFC) problem in a single-area power system with unknown parameters. LFC is a critical issue ...

Microsoft

Stochastic Approximation and Reinforcement Learning: Hidden Theory and New Super-Fast ...

Stochastic approximation algorithms are used to approximate solutions to fixed point equations that involve expectations of functions with respect to possibly unknown distributions. Among many ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果