强化学习_双语例句

go top 返回词典

为便于强化学习，他有和我交朋友的强烈意向。

For convenience of intensive study, he has an intense intention of making friend with me.

youdao
学习是一种重要的强化学习算法。

Learning is of great importance in reinforcement learning.

youdao
监督学习与强化学习的区别是什么？

What are the differences between supervised learning and reinforcement learning?

youdao
讨论平均准则控制马氏链的强化学习算法。

An average reward reinforcement learning algorithm for control Markov chains is presented.

youdao
你能解释一下什么是“强化学习”技术吗？

Can you explain the A. I. technique called reinforcement learning?

youdao
参加为期九天的强化学习，测试和解决问题实验。

They go through nine days of a tensui studying, testing, and problem solving.

youdao
10年定律是适用的：他此前进行过9年的集中强化学习。

Turns out the rule holds: he'd had nine years of intensive study.

youdao
将强化学习技术运用于游戏的集中方法在文献里都有记载。

Several approaches applying reinforcement learning techniques to game playing have been described in the literature.

youdao
论文主要研究了基于平均型强化学习算法的动态调度方法。

The thesis mainly focuses on the dynamic scheduling method based on the averaged rewards reinforcement learning algorithms.

youdao
文中基于性能势理论，证明了平均奖赏强化学习的逼近定理。

In this paper, the approximate theorem of average reward reinforcement learning is proven by means of the theory of performance potentials.

youdao
强化学习是这种情况下的常用技术，而更多的传统情形下需要使用效用函数。

Reinforcement learning is a common technique for this scenario as well as the more traditional scenario of actually learning the utility function.

youdao
强化学习的这种特性必然增加智能系统的困难性，学习时间增长。

This characteristic of reinforcement learning must increase learning difficulty for intelligent system and learning time also grows up.

youdao
初、中及高级培训班既可满足学员选择，也方便学员系统强化学习。

The series training programs of junior, middle and senior classes can not only satisfy trainee's selection accordingly, but also facilitate their continued upgrading progress for legal English.

youdao
说明：模拟智能机器小车，通过强化学习算法，学习最优导航策略。

Simulation machine car through reinforcement learning algorithm, learning optimal navigation strategies.

youdao
传统的强化学习算法只能解决离散状态空间和动作空间的学习问题。

Conventional reinforcement algorithms only deal with discrete state Spaces and discrete action Spaces.

youdao
基于Q强化学习与CMAC神经网络的移动机器人局部路径规划研究。

Research on local path planning of mobile robot based on Q reinforcement learning and CMAC neural networks.

youdao
莱昂是在强化学习中心和约翰·马歇尔中学受的教育，二者都在洛杉矶。

Leo was educated at the Center for Enriched Studies and John Marshall High School, both in LA.

youdao
提出强化学习中的状态熵，来描述观测状态信息不完整引起的不确定性；

The state entropy is proposed for reinforcement learning as the measurement of the state's uncertainty caused by the incompletion of the observed information.

youdao
基于强化学习的多指手控制方法，该方法将反馈控制与强化学习相结合。

The former one is a new approach combining reinforcement learning with feedback control.

youdao
其学习是对人类学习的模拟，主要有监督学习、强化学习和无监督学习三种。

The learning of connectionism, which consists mainly of supervised learning, intensive learning and unsupervised learning, is modelled after the learning of human beings.

youdao
主要研究了强化学习算法及其在机器人足球比赛技术动作学习问题中的应用。

This paper discusses reinforcement learning(RL)algorithm and its application to technical action learning of soccer robot.

youdao
在理论分析的基础上，提出了协同博弈的强化学习算法，并证明了算法的收敛性。

On the basis of theoretical analysis, the cooperative game reinforcement learning method is proposed and its convergence is proved.

youdao
学习是一种典型的强化学习，其学习效率较低，尤其是当状态空间和决策空间较大时。

Q-learning is a typical Reinforcement Learning (RL) method with a slow convergence speed especially as the scales of the state space and action space increase.

youdao
以多水下机器人的编队任务为例探讨了分布式强化学习在多机器人系统中的应用模式。

Taking formation problem of multiple underwater vehicles as an example, the application model of DRL in multi-robot systems is illustrated.

youdao
对于有吸收目标状态的循环任务，比较合理的方法是采用基于平均报酬模型的强化学习。

It is rational to adopt the average reward reinforcement learning algorithms for solving the absorbing goal states cyclical tasks.

youdao
本文提出了基于过程奖赏和优先扫除的强化学习算法作为多机器人系统的冲突消解策略。

A reinforcement learning algorithm based on process reward and prioritized sweeping is presented as interference solving strategy.

youdao
基于马尔科夫过程的强化学习作为一种在线学习方式，能够很好地应用于单智能体环境中。

Reinforcement learning based on Markov decision process is a way of on-line learning, which can be applied to single agent environment.

youdao
基于马尔科夫过程的强化学习作为一种在线学习方式，能够很好地应用于单智能体环境中。

Reinforcement learning based on Markov decision process is a way of on-line learning, which can be applied to single agent environment.

youdao

应用推荐

$firstVoiceSent

- 来自原声例句

双语例句

原声例句

权威例句

修改单词