MEHTA, PIYUSH; KUMAR JHA, ASHOK; KUMAR, M. SATHIS; RAJESHWARI, S.; B K, HARSHA; KUMAR, M. SANDEEP. DEEP REINFORCEMENT LEARNING FOR DYNAMIC TREATMENT REGIMES. TPM – Testing, Psychometrics, Methodology in Applied Psychology, [S. l.], v. 32, n. S2 (2025): Posted 09 June, p. 1412–1421, 2025. Disponível em: https://tpmap.org/submission/index.php/tpm/article/view/744. Acesso em: 11 jan. 2026.