intTypePromotion=1
zunia.vn Tuyển sinh 2024 dành cho Gen-Z zunia.vn zunia.vn
ADSENSE

Reinforcement learning

Xem 1-20 trên 142 kết quả Reinforcement learning
  • Bài viết trình bày một phương pháp giải bài toán lựa chọn PTHL động (Dynamic weapon target assignment -DWTA) sử dụng kỹ thuật học tăng cường sâu đa tác nhân (Multi-Agent Deep Reinforcement Learning), trong đó, các PTHL phòng không đóng vai trò là các tác nhân (agent), được xây dựng và huấn luyện trên bộ thư viện OpenAI Gym, đưa ra quyết định tiêu diệt mục tiêu trên không trong môi trường tác chiến phức tạp, không chắc chắn.

    pdf11p visergeyne 18-06-2024 2 1   Download

  • This paper will be broken into four sections. The introduction discusses the route planning control issues faced by the mobile robot. Section two details the mathematical modeling of an operating system designed for a mobile robot.

    pdf6p viambani 18-06-2024 2 1   Download

  • Trong nghiên cứu này, mục tiêu nhóm tác giả hướng đến là xây dựng phương pháp điều khiển thích nghi thông minh dựa trên thuật toán học tăng cường (Reinforcement learning, RL). Ưu điểm chính của việc phương pháp học tăng cường là khả năng học hỏi từ sự tương tác với môi trường và cung cấp một chiến lược điều khiển tối ưu, cho phép điều khiển hệ thống mà không cần biết trước về mô hình động học của đối tượng.

    pdf7p viambani 18-06-2024 2 2   Download

  • Lecture Artificial intelligence: Q learning. This lecture provides students with content including: supervised learning; unsupervised learning; reinforcement learning; utilize the Q matrix;... Please refer to the detailed content of the lecture!

    pdf18p codabach1016 03-05-2024 1 0   Download

  • Artificial intelligence - Lecture 14: Reinforcement learning. This lecture provides students with content including: reinforcement learning (RL); features of RL; applications of RL; supervised learning vs. reinforcement learning; policy, reward and goal; optimal policies; policy adaptation methods;... Please refer to the detailed content of the lecture!

    pdf6p codabach1016 03-05-2024 4 2   Download

  • This study aims to determine the impact of the ECIRR (Elicit, Confront, Identify, Resolve, Reinforce) learning model on students' mathematical reasoning abilities in terms of student motivation. The research method used was a quasi-experimental method with a post-test only control design research design.

    pdf11p viarnault 25-04-2024 2 1   Download

  • Mastering vocabulary is one of the most critical factors that help learners improve their language proficiency. However, this also puts pressure on learners, especially language learners. This research was carried out to investigate common vocabulary strategies students use for learning the words for the first time and methods for reinforcing the learned ones. In addition, the author tried to identify drawbacks that students deal with in vocabulary acquisition.

    pdf11p vilarry 01-04-2024 1 1   Download

  • Nowadays, many applications uses speech recognition especially the field of computer science and electronics, Speech Recognition (SR) is the interpretation of words spoken into a text. It is also known as Speech-To-Text (STT) or Automatic-Speech-Recognition(ASR), or just Word-Recognition(WR). The HiddenMarkov-Model (HMM) is a type of Markov model, which means that the future state of the model depends on the current state, not on the entire history of the system and the goal of HMM is to learn a sequence of hidden states from a set of known states.

    pdf5p viritesh 02-04-2024 4 1   Download

  • Bài viết "Thiết kế ma trận dịch pha cho RIS trong hệ thống 5G" đề xuất thuật toán học sâu tăng cường DRL để giải quyết mục tiêu tối ưu hóa ma trận dịch pha cho tia phản xạ. Xây dựng các môi trường và thuật toán sử dụng để tối ưu, cũng như thiết kế ma trận dịch pha cho tia phản xạ trong hệ thống 5G được hỗ trợ bởi RIS. Mời các bạn cùng tham khảo!

    pdf7p phocuuvan0201 02-02-2024 1 0   Download

  • The objective is to maximize the task completion rate, considering factors such as user transmit power, task offloading rate, and UAV trajectory variables. To tackle this optimization problem, this paper devises a method based on the Deep Deterministic Policy Gradient (DDPG), an algorithm for continuous action spaces in deep reinforcement learning.

    pdf11p vigojek 02-02-2024 7 2   Download

  • Part 2 book "Artificial intelligence - A modern approach" includes content: Probabilistic reasoning over time; making simple decisions; making complex decisions; learning from examples; knowledge in learning; learning probabilistic models; reinforcement learning; natural language processing; natural language for communication; perception; robotics,... and other contents.

    pdf567p muasambanhan05 16-01-2024 4 0   Download

  • Part 1 book "Zoo animal learning and training" includes content: Learning theory; the cognitive abilities of wild animals; the ultimate benefits of learning; choosing the right method - reinforcement vs punishment; what is there to learn in a zoo setting environmental enrichment - the creation of opportunities for informal learning; the art of "active" training.

    pdf170p muasambanhan02 18-12-2023 2 0   Download

  • Part 1 book "Knowing your horse - A guide to equine learning, training and behaviour" includes content: The principles of good horse training, does classical conditioning ring a bell, living with the consequences, all possible consequences, other laws and factors in learning, the power of positive reinforcement.

    pdf96p oursky06 17-10-2023 2 2   Download

  • Part 2 book "Knowing your horse - A guide to equine learning, training and behaviour" includes content: The sound of learning clicker training, negative reinforcement - reinforcement through escape, understanding punishment, how to deal with unwanted behaviours without using punishment, step by step.

    pdf113p oursky06 17-10-2023 7 2   Download

  • The shear strength of corroded reinforced concrete (CRC) beams is a critical consideration during the design stages of RC structures. In this study, we propose a machine learning technique for estimating the shear strength of CRC beams across a range of service periods.

    pdf12p visharma 20-10-2023 9 4   Download

  • Adaptive Neuro-Based Fuzzy Inference System (ANFIS) and Particle Swarm Optimization (PSO) algorithms were utilized to produce numerical tools for predicting the bond strength between the concrete surface and carbon fiber reinforced polymer (CFRP) sheets. From the relevant literature, a credible database encompassing 242 test specimens was developed, along with six input factors that primarily determine bond strength.

    pdf12p visharma 20-10-2023 9 5   Download

  • Part 2 book "Asking animals - An introduction to animal behaviour testing" includes content: Effects of age and treatment; reinforcement and punishment; learning capacity, memory and cognitive ability; genetic components of behaviour; other test considerations; legislation, guidelines and ethical considerations, future methodologies and technological advances.

    pdf102p oursky06 13-10-2023 6 2   Download

  • Ebook "Machine learning" includes content: Introduction, concept learning and the general to specific ordering; decision tree learning; artificial neural networks; evaluating hypotheses; bayesian learning; computational learning theory; instance based learning; genetic algorithms; learning sets of rules; analytical learning; combining inductive and analytical learning; reinforcement learning.

    pdf421p haojiubujain07 20-09-2023 6 4   Download

  • Ebook "Problem solving with reinforcement learning" includes content: Alternative Q-Learning update rules; connectionist reinforcement learing; the robot proplem, systems with real-valued actions, conclusions.

    pdf113p haojiubujain07 20-09-2023 3 3   Download

  • Ebook "Python machine learning projects" provides readers with contents including: setting up a python programming environment an introduction to machine learning; how to build a machine learning classifier in python with scikit-learn; how to build a neural network to recognize hand written digits with tensorflow; bias-variance for deep reinforcement learning - how to build a bot for atari with openai gym;...

    pdf135p tieulangtran 28-09-2023 10 4   Download

CHỦ ĐỀ BẠN MUỐN TÌM

TOP DOWNLOAD
207 tài liệu
1444 lượt tải
ADSENSE

nocache searchPhinxDoc

 

Đồng bộ tài khoản
2=>2