为便于强化学习,他有和我交朋友的强烈意向。
For convenience of intensive study, he has an intense intention of making friend with me.
学习是一种重要的强化学习算法。
监督学习与强化学习的区别是什么?
What are the differences between supervised learning and reinforcement learning?
讨论平均准则控制马氏链的强化学习算法。
An average reward reinforcement learning algorithm for control Markov chains is presented.
你能解释一下什么是“强化学习”技术吗?
Can you explain the A. I. technique called reinforcement learning?
参加为期九天的强化学习,测试和解决问题实验。
They go through nine days of a tensui studying, testing, and problem solving.
10年定律是适用的:他此前进行过9年的集中强化学习。
Turns out the rule holds: he'd had nine years of intensive study.
将强化学习技术运用于游戏的集中方法在文献里都有记载。
Several approaches applying reinforcement learning techniques to game playing have been described in the literature.
论文主要研究了基于平均型强化学习算法的动态调度方法。
The thesis mainly focuses on the dynamic scheduling method based on the averaged rewards reinforcement learning algorithms.
文中基于性能势理论,证明了平均奖赏强化学习的逼近定理。
In this paper, the approximate theorem of average reward reinforcement learning is proven by means of the theory of performance potentials.
强化学习是这种情况下的常用技术,而更多的传统情形下需要使用效用函数。
Reinforcement learning is a common technique for this scenario as well as the more traditional scenario of actually learning the utility function.
强化学习的这种特性必然增加智能系统的困难性,学习时间增长。
This characteristic of reinforcement learning must increase learning difficulty for intelligent system and learning time also grows up.
初、中及高级培训班既可满足学员选择,也方便学员系统强化学习。
The series training programs of junior, middle and senior classes can not only satisfy trainee's selection accordingly, but also facilitate their continued upgrading progress for legal English.
说明:模拟智能机器小车,通过强化学习算法,学习最优导航策略。
Simulation machine car through reinforcement learning algorithm, learning optimal navigation strategies.
传统的强化学习算法只能解决离散状态空间和动作空间的学习问题。
Conventional reinforcement algorithms only deal with discrete state Spaces and discrete action Spaces.
基于Q强化学习与CMAC神经网络的移动机器人局部路径规划研究。
Research on local path planning of mobile robot based on Q reinforcement learning and CMAC neural networks.
莱昂是在强化学习中心和约翰·马歇尔中学受的教育,二者都在洛杉矶。
Leo was educated at the Center for Enriched Studies and John Marshall High School, both in LA.
提出强化学习中的状态熵,来描述观测状态信息不完整引起的不确定性;
The state entropy is proposed for reinforcement learning as the measurement of the state's uncertainty caused by the incompletion of the observed information.
基于强化学习的多指手控制方法,该方法将反馈控制与强化学习相结合。
The former one is a new approach combining reinforcement learning with feedback control.
其学习是对人类学习的模拟,主要有监督学习、强化学习和无监督学习三种。
The learning of connectionism, which consists mainly of supervised learning, intensive learning and unsupervised learning, is modelled after the learning of human beings.
主要研究了强化学习算法及其在机器人足球比赛技术动作学习问题中的应用。
This paper discusses reinforcement learning(RL)algorithm and its application to technical action learning of soccer robot.
在理论分析的基础上,提出了协同博弈的强化学习算法,并证明了算法的收敛性。
On the basis of theoretical analysis, the cooperative game reinforcement learning method is proposed and its convergence is proved.
学习是一种典型的强化学习,其学习效率较低,尤其是当状态空间和决策空间较大时。
Q-learning is a typical Reinforcement Learning (RL) method with a slow convergence speed especially as the scales of the state space and action space increase.
以多水下机器人的编队任务为例探讨了分布式强化学习在多机器人系统中的应用模式。
Taking formation problem of multiple underwater vehicles as an example, the application model of DRL in multi-robot systems is illustrated.
对于有吸收目标状态的循环任务,比较合理的方法是采用基于平均报酬模型的强化学习。
It is rational to adopt the average reward reinforcement learning algorithms for solving the absorbing goal states cyclical tasks.
本文提出了基于过程奖赏和优先扫除的强化学习算法作为多机器人系统的冲突消解策略。
A reinforcement learning algorithm based on process reward and prioritized sweeping is presented as interference solving strategy.
基于马尔科夫过程的强化学习作为一种在线学习方式,能够很好地应用于单智能体环境中。
Reinforcement learning based on Markov decision process is a way of on-line learning, which can be applied to single agent environment.
基于马尔科夫过程的强化学习作为一种在线学习方式,能够很好地应用于单智能体环境中。
Reinforcement learning based on Markov decision process is a way of on-line learning, which can be applied to single agent environment.
应用推荐