MEHTA, PIYUSH, ASHOK KUMAR JHA, M. SATHIS KUMAR, S. RAJESHWARI, HARSHA B K, and M. SANDEEP KUMAR. “DEEP REINFORCEMENT LEARNING FOR DYNAMIC TREATMENT REGIMES”. TPM – Testing, Psychometrics, Methodology in Applied Psychology 32, no. S2(2025) : Posted 09 June (June 9, 2025): 1412–1421. Accessed September 12, 2025. https://tpmap.org/submission/index.php/tpm/article/view/744.