A neural reinforcement learning model for tasks with unknown time delays - 代码天地

A neural reinforcement learning model for tasks with unknown time delays

其他 2020-05-23 20:38:11 阅读次数: 0

郑重声明：原文参见标题，如有侵权，请联系作者，将会撤销发布！

Abstract

　　我们提出了一个基于生物学的神经模型，能够在复杂的任务中执行强化学习。该模型的独特之处在于，它能够在一个行动、状态转换和奖励之间存在未知和可变时间延迟的环境中，解决需要智能体执行一系列未经奖励的操作以达到目标的任务。具体来说，这是第一个能够在半马尔可夫决策过程（Semi-Markov Decision Process，SMDP）框架内发挥作用的强化学习神经模型。我们认为，当前建模工作的这种扩展为人类决策的日益复杂的模型奠定了基础。

Keywords: 强化学习；神经模型；SMDP

1. Introduction

2. Background

3. Methods

3.1 Model architecture

3.2 Representing and computing with neural activities

3.3 Learning

3.4 Error calculation

4. Results

5. Discussion

猜你喜欢

转载自www.cnblogs.com/lucifer1997/p/12944231.html

A neural reinforcement learning model for tasks with unknown time delays

论文笔记12:Building Adaptive Tutoring Model using Artificial Neural Networks and Reinforcement Learning

Deep Reinforcement Learning is a waste of time

[Reinforcement Learning] Model-Free Prediction

Time Delays and deferred work

论文笔记系列-Neural Architecture Search With Reinforcement Learning

2017-ICLR-Neural Architecture Search with Reinforcement Learning 论文阅读

NAS：NEURAL ARCHITECTURE SEARCH WITH REINFORCEMENT LEARNING NAS开山之作

Neural Network Dynamics for Model-Based Deep Reinforcement Learniing with Model-Free Fine-Tuning

Reinforcement Learning强化学习系列之一：model-based learning

Linux Kernel Programming - Time,Delays,and Deferred Work

LDD-Time, Delays, and Deferred Work

CAPES:Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement Learning

网络结构搜索（1）—— NAS（Neural architecture search with reinforcement learning）论文笔记

《Graph Representation Learning》【5】——The Graph Neural Network Model

Reinforcement Learning(001)

reinforcement-learning-1

Introduction to Reinforcement Learning

Reinforcement Learning——MDP

Tutorials on Inverse Reinforcement Learning

A Distributional Perspective on Reinforcement Learning

Reinforcement Learning 增强学习

Robust Adversarial Reinforcement Learning

Control of a Quadrotor with Reinforcement Learning

Reinforcement Learning NOTE

Policy in Reinforcement Learning

Reinforcement Learning Cheatsheet

【ML】Reinforcement Learning

Reinforcement Learning 笔记（1）

Reinforcement Learning 笔记（4）

今日推荐

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

《2024 年一季度互联网投融资运行情况》研究报告

报告：Django 仍然是 74% 开发者的首选

周排行

返回指定时间格式

fopen函数中的mode参数

Java 单例模式探讨

Flex remoteobject工作原理探讨

寻找mplayer的便捷安装方法

30天了解30种技术系列---(26)MySQL自动化运维工具Inception

关于Jboss/Tomcat/Jetty的JNDI定义123

程序减肥，strip，eu-strip 及其符号表

AsyncTask、View.post(Runnable)、ViewTreeObserver三种方式总结frame animation自动启动

Json和Bean的互相转换

每日归档

更多

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)