Deep reactive policies for planning in stochastic nonlinear domains (2019)
- Authors:
- USP affiliated authors: BARROS, LELIANE NUNES DE - IME ; MAUÁ, DENIS DERATANI - IME ; BUENO, THIAGO PEREIRA - IME
- Unidade: IME
- DOI: 10.1609/aaai.v33i01.33017530
- Assunto: APRENDIZADO COMPUTACIONAL
- Agências de fomento:
- Language: Inglês
- Imprenta:
- Source:
- Título do periódico: Proceedings
- Conference titles: AAAI Conference on Artificial Intelligence
- Este periódico é de assinatura
- Este artigo é de acesso aberto
- URL de acesso aberto
- Cor do Acesso Aberto: bronze
-
ABNT
BUENO, Thiago Pereira et al. Deep reactive policies for planning in stochastic nonlinear domains. 2019, Anais.. Palo Alto: AAAI, 2019. Disponível em: https://doi.org/10.1609/aaai.v33i01.33017530. Acesso em: 24 abr. 2024. -
APA
Bueno, T. P., Barros, L. N. de, Mauá, D. D., & Sanner, S. (2019). Deep reactive policies for planning in stochastic nonlinear domains. In Proceedings. Palo Alto: AAAI. doi:10.1609/aaai.v33i01.33017530 -
NLM
Bueno TP, Barros LN de, Mauá DD, Sanner S. Deep reactive policies for planning in stochastic nonlinear domains [Internet]. Proceedings. 2019 ;[citado 2024 abr. 24 ] Available from: https://doi.org/10.1609/aaai.v33i01.33017530 -
Vancouver
Bueno TP, Barros LN de, Mauá DD, Sanner S. Deep reactive policies for planning in stochastic nonlinear domains [Internet]. Proceedings. 2019 ;[citado 2024 abr. 24 ] Available from: https://doi.org/10.1609/aaai.v33i01.33017530 - On the performance of planning through backpropagation
- Decision-aware model learning for actor-critic methods: when theory does not meet practice
- When a robot reaches out for human help
- Gradient estimation in model-based reinforcement learning: a study on linear quadratic environments
- Planning in stochastic computation graphs: solving stochastic nonlinear problems with backpropagation
- Analyzing the effect of stochastic transitions in policy gradients in deep reinforcement learning
- Exploration versus exploitation in model-based reinforcement learning: an empirical study
- A contact network-based approach for online planning of containment measures for COVID-19
- Markov decision processes specified by probabilistic logic programming: representation and solution
- Modeling Markov decision processes with imprecise probabilities using probabilistic logic programming
Informações sobre o DOI: 10.1609/aaai.v33i01.33017530 (Fonte: oaDOI API)
Download do texto completo
Tipo | Nome | Link | |
---|---|---|---|
2954406.pdf |
How to cite
A citação é gerada automaticamente e pode não estar totalmente de acordo com as normas