Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1502.04635
Cited By
v1
v2 (latest)
Parameter estimation in softmax decision-making models with linear objective functions
16 February 2015
Paul B. Reverdy
Naomi Ehrich Leonard
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Parameter estimation in softmax decision-making models with linear objective functions"
5 / 5 papers shown
Parallel bandit architecture based on laser chaos for reinforcement learning
Journal of Physics Communications (J. Phys. Commun.), 2022
Takashi Urushibara
N. Chauvet
Satoshi Kochi
S. Sunada
Kazutaka Kanno
Atsushi Uchida
R. Horisaki
Makoto Naruse
234
1
0
19 May 2022
Reinforcement Learning for Load-balanced Parallel Particle Tracing
Jiayi Xu
Hanqi Guo
Han-Wei Shen
Mukund Raj
Skylar W. Wurster
Tom Peterka
144
8
0
13 Sep 2021
Revisiting the Softmax Bellman Operator: New Benefits and New Perspective
Zhao Song
Ronald E. Parr
Lawrence Carin
244
4
0
02 Dec 2018
Data-Free/Data-Sparse Softmax Parameter Estimation with Structured Class Geometries
Nisar R. Ahmed
198
18
0
03 Jun 2018
On the Properties of the Softmax Function with Application in Game Theory and Reinforcement Learning
Bolin Gao
Lacra Pavel
FAtt
458
371
0
03 Apr 2017
1
Page 1 of 1