2025
    
    
  
  
      “ On Generalization Within Multi-Objective Reinforcement Learning Algorithms” 
        by
       Jayden Teoh,  Pradeep Varakantham and Peter Vamplew. ICLR, 2025. 
    
    
  
  
      “ On Minimizing Adversarial Counterfactual Error in Adversarial Reinforcement Learning” 
        by
       Roman Belaire, Arunesh Sinha and  Pradeep Varakantham. ICLR, 2025. 
    
    
  
  
      “ On Semantic Loss-Guided Data-Efficient Supervised Fine-Tuning for Safe Responses in LLMs” 
        by
       Yuxiao Lu, Arunesh Sinha and  Pradeep Varakantham. ICLR, 2025. 
    
    
  
  
      “ Bootstrapping Language Models with DPO Implicit Rewards” 
        by
       Changyu Chen, Zichen Liu, Chao Du, Tianyu Pang, Qian Liu, Arunesh Sinha,  Pradeep Varakantham and Min Lin. ICLR, 2025. 
    
    
  
  
      “ Unlocking the Planning Capabilities of LLMs Through Maximum Diversity Fine-Tuning” 
        by
       Wenjun Li, Changyu Chen and  Pradeep Varakantham . NAACL, 2025. 
    
    
  
  
  
      “ Marginal Benefit Driven RL Teacher for Unsupervised Environment Design” 
        by
       Dexun Li, Wenjun Li and  Pradeep Varakantham . AAAI, 2025.  Oral Presentation 
    
    
  
  
      “ Offline Safe Reinforcement Learning Using Trajectory Classification” 
        by
        Ze Gong, Akshat Kumar and  Pradeep Varakantham . AAAI, 2025.  Oral Presentation 
    
    
  
  
      “ EduQate: Generating Adaptive Curricula through RMABs in Education Settings” 
        by
       Sidney Tio, Dexun Li and  Pradeep Varakantham . AAMAS, 2025. 
    
    
  
  
      “ On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression” 
        by
        Zichang Ge, Changyu Chen, Arunesh Sinha and  Pradeep Varakantham . AAMAS, 2025. 
    
    
  
  
    2024
    
    
  
  
      “ IRL for Restless Multi-armed Bandits with Applications in Maternal and Child Health."” 
        by
       Gauri Jain,  Pradeep Varakantham, Aparna Taneja, Haifeng Xu, Prashant Doshi and Milind Tambe. PRICAI, 2024. Best Paper Runner up 
    
    
  
  
      “ Improving Environment Novelty Quantification for Effective Unsupervised Environment Design” 
        by
       Jayden Teoh, Wenjun Li and  Pradeep Varakantham. NeurIPS, 2024.  Oral Presentation 
    
    
  
  
      “ Safety through feedback in Constrained RL” 
        by
       Shashank Chirra  Pradeep Varakantham and Praveen Paruchuri. NeurIPS, 2024.
    
    
  
  
      “ SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning"” 
        by
       Hoang Minh Huy, Mai Anh Tien and  Pradeep Varakantham. NeurIPS, 2024.
    
    
  
  
      “ Unsupervised Training Sequence Design: Efficient and Generalizable Agent Training” 
        by
       Wenjun Li and  Pradeep Varakantham. AAAI, 2024.
    
    
  
  
      “ Imitate the Good and Avoid the Bad: An incremental approach to Safe Reinforcement Learning” 
        by
       Minh Huy Hoang, Mai Anh Tien and  Pradeep Varakantham. AAAI, 2024.
    
    
  
  
      “ Reward Penalties on Augmented States for Solving Richly Constrained RL Effectively” 
        by
       Jiang Hao, Mai Anh Tien,  Pradeep Varakantham and Minh Huy Hoang. AAAI, 2024.
    
    
  
  
      “ Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement Learning” 
        by
       Yuxiao Lu, Arunesh Sinha and  Pradeep Varakantham. AAAI, 2024.
    
    
  
  
      “ Regret-based Defense in Adversarial Reinforcement Learning” 
        by
       Roman Belaire,  Pradeep Varakantham, Thanh Nguyen and David Lo . AAMAS, 2024.
    
    
  
  
      “ Imitating Cost-Constrained Behaviors in Reinforcement Learning” 
        by
       Shao Qian,  Pradeep Varakantham and Shih-fen Cheng. ICAPS, 2024.
    
    
  
  
    2023
    
    
  
  
      “ Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning” 
        by
        Changyu Chen, Ramesha Karunasena, Thanh Hong Nguyen, Arunesh Sinha and  Pradeep Varakantham. NeurIPS, 2023.
    
    
  
  
      “ Generalization through Diversity: Improving Unsupervised Environment Design” 
        by
       Wenjun Li,  Pradeep Varakantham and Dexun Li. IJCAI, 2023.
    
    
  
  
      “ Transferable Curricula through Difficulty Conditioned Generators ” 
       by
       Sidney Tio and  Pradeep Varakantham. IJCAI, 2023.
    
    
  
  
      “ Constrained Reinforcement Learning in Hard Exploration Problems ” 
       by
       Pankayaraj Pathmanathan and  Pradeep Varakantham. AAAI, 2023.
    
    
  
  
      “ Future Aware Pricing and Matching for Sustainable On-demand Ride Pooling ” 
        by
       Xianjie Zhang,  Pradeep Varakantham and Hao Jiang. AAAI, 2023.
    
    
  
  
      “ Knowledge Compilation for Constrained Combinatorial Action Spaces in Reinforcement Learning ” 
       by
      Jiajing Ling, Moritz Lukas Schuler, Akshat Kumar and  Pradeep Varakantham. AAMAS, 2023.
    
    
  
  
      “ Avoiding Starvation of Arms in Restless Multi-Armed Bandits” 
        by
       Dexun Li and  Pradeep Varakantham. AAMAS, 2023.
    
    
  
  
      “ Strategic Planning for Flexible Agent Availability in Large Taxi Fleets ” 
       by
       Rajiv Ranjan Kumar,  Pradeep Varakantham and Shih-fen Cheng. AAMAS, 2023. 
    
    
  
  
    2022
    
    
  
  
      “ Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-Profits in Improving Maternal and Child Health ”  by
      Aditya Mate, Lovish Madan, Aparna Taneja, Neha Madhiwalla, Shresth Verma, Gargi Singh, Aparna Hegde,  Pradeep Varakantham,  and Milind Tambe.
        AAAI, 2022.  
    
    
    
    
        “ Hierarchical Value Decomposition for Effective On-Demand Ride Pooling. 
        ”  by
         Hao Jiang and  Pradeep Varakantham. AAMAS, 2022.
      
      
    
    
        “ Joint Pricing and Matching for City-Scale Ride Pooling. 
        ”  by
         Sanket Shah, Meghna Lowalekar and  Pradeep Varakantham. ICAPS, 2022.
      
      
    
    
        “ Efficient Resource Allocation with Fairness Constraints in Restless Multi-Armed Bandits. 
        ”  by
         Dexun Li and  Pradeep Varakantham. UAI, 2022.
      
      
    
  
    2021
    
    
  
  
    “ Zone pAth Construction (ZAC) based Approaches for Effective Real-Time Ridesharing ”  by
   Meghna Lowalekar,  Pradeep Varakantham  and Patrick Jaillet.
     Journal of Artificial Intelligence Research (JAIR), 2021.   JAIR Award Track 
  
  
  
  
      “ Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare  
      ”  by
      by Arpita Biswas, Gaurav Aggarwal,  Pradeep Varakantham  and Milind Tambe. International Joint Conference on Artificial Intelligence (IJCAI), 2021.
    
    
  
  
      “ Zone pAth Construction (ZAC) based approaches for Effective Real-Time Ridesharing 
      ”  by
      by Meghna Lowalekar  Pradeep Varakantham  and Patrick Jaillet. International Joint Conference on Artificial Intelligence (IJCAI), 2021 Journal Track.
    
    
    
    “  CLAIM: Curriculum Learning Policy for Influence Maximization in Unknown Social Networks
    ”  by
    by Dexun Li, Meghna Lowalekar and  Pradeep Varakantham . Uncertainty in Artificial Intelligence (UAI) 2021.
  
  
    “  Adaptive Operating Hours for Improved Performance of Taxi Fleets
    ”  by
    by Rajiv Ranjan Kumar,  Pradeep Varakantham and Shih-Fen Cheng. Autonomous Agents and Multi-Agent Systems (AAMAS) 2021.
  
  
    “  Learning Index Policies for Restless Bandits with Application to Maternal Healthcare
    ”  by
    by Arpita Biswas, Gaurav Aggarwal,  Pradeep Varakantham and Milind Tambe. Extended Abstract at Autonomous Agents and Multi-Agent Systems (AAMAS) 2021.
  
  
  
    2020
    
    
      “ Neural Approximate Dynamic Programming for On-Demand Ride-Pooling ”  by
     Sanket Shah, Meghna Lowalekar and  Pradeep Varakantham .
        Published in the Conference of Association for Advancement of AI, AAAI 2020. 
    
    
    
      “ Solving Online Threat Screening Games using Constrained Action Space Reinforcement Learning ”  by
      Sanket Shah, Arunesh Sinha,  Pradeep Varakantham, Andrew Perrault and Milind Tambe.
        Published in the Conference of Association for Advancement of AI, AAAI 2020. 
    
    
      “ Competitive Ratios for Online Multi-capacity Ridesharing ”  by
     Meghna Lowalekar, Pradeep Varakantham and Patrick Jaillet.
        Published in International Conference on Autonomous Agents and Multi-Agent Systems, AAMAS 2020.
    
    
      “ Online Traffic Signal Control through Sample Based Constraint Optimization ”  by
   Srishti Dhamija, Alolika Gon,  Pradeep Varakantham and William Yeoh.
        Published in International Conference on Automated Planning and Scheduling, ICAPS 2020.
    
    
 
 
     2019
     
     
 
 
       “ ZAC: A Zone Path Construction Approach for Effective Real-time Ridesharing ”  by
      Meghna Lowalekar, Pradeep Varakantham and Patrick Jaillet.
         Published in International Conference on Automated Planning and Scheduling, ICAPS 2019.  Best Application Paper award. 
     
     
     
 
 
       “ Resource Constrained Deep Reinforcement Learning ”  by
     Abhinav Bhatia,  Pradeep Varakantham and Akshat Kumar.
         Published in International Conference on Automated Planning and Scheduling, ICAPS 2019.
     
     
 
 
 
       “ Correlated Learning for Aggregation Systems ”  by
      Tanvi Verma and Pradeep Varakantham.
         Published in International Conference on Uncertainty in Artificial Intelligence, UAI 2019.
     
     
 
 
 
       “ Entropy based Independent Learning in Anonymous Multi-Agent Settings ”  by
    Tanvi Verma,   Pradeep Varakantham and Hoong Chuin Lau.
         Published in International Conference on Automated Planning and Scheduling, ICAPS 2019.
     
     
 
 
 
     2018
     
     
 
 
       “ Online Spatio-Temporal Matching in Stochastic and Dynamic Domains ”  by
      Meghna Lowalekar, Pradeep Varakantham and Patrick Jaillet.
         Published in Artificial Intelligence Journal (AIJ)
         
         
         
 
 
       “ Dispatch Guided Allocation Optimization for Effective Emergency Response ”  by
      Supriyo Ghosh and Pradeep Varakantham.
         Published at National Conference on Artificial Intelligence, AAAI-18
         
         
         
 
 
       “ Decentralized Planning for Non-Dedicated Agent Teams with Submodular Rewards in Uncertain Environments ”  by
      Pritee Agrawal, Pradeep Varakantham and William Yeoh.
         Accepted for publication at the conference on Uncertainty in Artificial Intelligence, UAI-18
     
     
 
 
 
       “ Bounded Rank Optimization for Effective and Efficient Emergency Response ”  by
      Pallavi Manohar, Pradeep Varakantham and Hoong Chuin Lau.
         Accepted for publication at the International Conference on Automated Planning and Scheduling, ICAPS-18
     
     
 
 
 
       “ Reserved Optimization: Handling Incident Priorities in Emergency Response Systems ”  by
      Muralidhar Konda, Supriyo Ghosh and Pradeep Varakantham.
         Accepted for publication at the International Conference on Automated Planning and Scheduling, ICAPS-18
     
     
 
 
 
       “ Upping the game of taxi driving in the age of Uber ”  by
      Shashi Shekhar Jha, Shih-Fen Cheng, Meghna Lowalekar, Nicholas Wong Wai Hin, Rishikeshan Rajendram, Tran Trong Khiem, Pradeep Varakantham, Truong Trong Nghia and Firmansyah Rahman.
         Published at the conference on Innovative Applications of Artificial Intelligence (IAAI-18)
     
     
 
 
 
    
2017
      “ Sampling based Approaches for Minimizing Regret in Uncertain Markov Decision Problems (MDPs) ”, by
 Asrar Ahmed, Pradeep Varakantham, Meghna Lowalekar, Yossiri Adulyasak and Patrick Jaillet.
    Accepted for Publication at Journal of Artificial Intelligence Research (JAIR) (PDF)
    
    
    
“Dynamic Repositioning to Reduce Lost Demand in Bike Sharing Systems ”, by
 Supriyo Ghosh, Pradeep Varakantham, Yossiri Adulyasak and Patrick Jaillet.
    Accepted for Publication at Journal of Artificial Intelligence Research (JAIR) (PDF)
    
    
    
    
    “Risk-Sensitive Stochastic Orienteering Problems for Trip Optimization in Urban Environments
        ”, by
     Pradeep Varakantham, Akshat Kumar, Hoong Chuin Lau, William Yeoh.
        Accepted for Publication at Transactions on Intelligent Systems and Technology (TIST) 
 (
            PDF,  Supplementary Material)
        
        
        
        
        
            
“Proactive and Reactive Coordination of Non-dedicated Agent Teams Operating in Uncertain Environments ”, by
 Pritee Agrawal and Pradeep Varakantham.
    Accepted for Publication at International Joint Conference on Artificial Intelligence, IJCAI-17(PDF)
    
    
    
    
    “Mechanism Design for Strategic Project Scheduling ”, by
     Pradeep Varakantham and Na Fu  .
        Accepted for Publication at International Joint Conference on Artificial Intelligence, IJCAI-17(PDF)
        
        
        
        
“Decentralized Planning in Stochastic Environments with Submodular Rewards ”, by Rajiv Ranjan Kumar,
Pradeep Varakantham and Akshat Kumar.
In Proceedings of the AAAI Conference on Artificial
Intelligence, AAAI-17 (PDF)
“Exploiting Anonymity and Homogeneity in Factored Dec-MDPs through Pre-computed Binomial Distributions ”, by Rajiv Ranjan Kumar and
Pradeep Varakantham.
In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS-17 (PDF)
“Incentivizing the Use of Bike Trailers for Dynamic Repositioning in Bike Sharing Systems ”, by Supriyo Ghosh and
Pradeep Varakantham.
In Proceedings of the International Conference on Automated Planning and Scheduling, ICAPS-17. (PDF)
“Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Improving Revenues ”, by Tanvi Verma,
Pradeep Varakantham, Sarit Kraus and Hoong Chuin Lau.
In Proceedings of the International Conference on Automated Planning and Scheduling, ICAPS-17. (PDF)
“Online Repositioning in Bike Sharing Systems ”, by Meghna Lowalekar,
Pradeep Varakantham, Supriyo Ghosh, Sanjay Dominik Jena and Patrick Jaillet.
In Proceedings of the International Conference on Automated Planning and Scheduling, ICAPS-17. (PDF)
“ Artificial Intelligence Research in Singapore: Assisting the Development of a Smart Nation", by
Pradeep Varakantham, Bo An, Bryan Low and Jie Zhang.
Accepted for publication in AI Magazine. (PDF)
 
2016
 |
“Sequential Decision Making for Improving Efficiency in Urban Environments ”, by
Pradeep Varakantham .
Accompanying paper for the   Early Career Spotlight   invited talk. In Proceedings of the International Joint Conference on Artificial
Intelligence, IJCAI-16 (PDF)
 |
“Scalable Greedy Algorithms for Task/Resource Constrained Multi-Agent Stochastic Planning ”, by Pritee Agrawal,
Pradeep Varakantham and William Yeoh.
In Proceedings of the International Joint Conference on Artificial
Intelligence, IJCAI-16 (PDF)
 |
“Robust Repositioning to Counter Unpredictable Demand in Bike Sharing Systems ”, by Supriyo Ghosh, Michael Trick and
Pradeep Varakantham.
In Proceedings of the International Joint Conference on Artificial
Intelligence, IJCAI-16 (PDF)
 |
“Online Spatio-Temporal Matching in Stochastic and Dynamic Domains ”, by Meghna Lowalekar,
Pradeep Varakantham and Patrick Jaillet.
In Proceedings of the AAAI Conference on Artificial
Intelligence, AAAI-16 (PDF)
 |
“A Proactive Sampling Approach to Project Scheduling under Uncertainty ”, by
Pradeep Varakantham, Na Fu and Hoong Chuin Lau.
In Proceedings of the AAAI Conference on Artificial
Intelligence, AAAI-16 (PDF)
 |
“Solving Risk Sensitive POMDPs With and Without Cost Observations ”, by Ping Hou, William Yeoh and
Pradeep Varakantham.
In Proceedings of the AAAI Conference on Artificial
Intelligence, AAAI-16 (PDF)
 |
“Robust Decision Making for Stochastic Network Design ”, by Akshat Kumar, Arambam Singh,
Pradeep Varakantham and Daniel Sheldon.
In Proceedings of the AAAI Conference on Artificial
Intelligence, AAAI-16 (PDF)
 |
“ Strategic Planning for Setting up Base Stations in Emergency Medical Systems ”, by Supriyo Ghosh and
Pradeep Varakantham.
In Proceedings of the International Conference on Automated Planning and Scheduling, ICAPS-16.(PDF) 
 |
“Robust Partial Order Schedules for RCPSP/max with Durational Uncertainty ”, by Na Fu,
Pradeep Varakantham and Hoong Chuin Lau.
In Proceedings of the International Conference on Automated Planning and Scheduling, ICAPS-16.(PDF) 
 |
“An Intelligent System for Personalized Conference Event Recommendation and Scheduling ”, by Aldy Gunawan, Hoong Chuin Lau, Pradeep Varakantham and Wenjie Wang.
In Proceedings of the conference on Prestigious Applications of Intelligent Systems, PAIS that is colocated with European Conference on Artificial Intelligence, ECAI-16. 
 |
“Robust Influence Maximization ”, by Meghna Lowalekar,
Pradeep Varakantham and Akshat Kumar.
In Proceedings of International Conference on Agents and Multi-Agent Systems, AAMAS-16 (Short Paper). 
 |
“Detecting Communities using Coordination Games ”, by Radhika Arava and
Pradeep Varakantham.
In Proceedings of European Conference on Artificial Inelligence, ECAI-16 (Short Paper). 
 |
“NLU Framework for Voice Enabling Non-Native Applications on Smart Devices ”, by Soujanya Lanka, Deepika Pathania, Pooja Kushalappa and
Pradeep Varakantham.
In Proceedings of AAAI Conference on Artificial
Intelligence, AAAI-16 (Demonstration Paper). 
 |
“PRESS: PeRsonalized Event Scheduling recommender System (Demo Paper) ”, by Hoong Chuin Lau, Aldy Gunawan, Pradeep Varakantham and Wenjie Wang.
In Proceedings of International Conference on Agents and Multi-Agent Systems, AAMAS-16 (Demonstration Paper). 
 |
2015
 |
"DIRECT: A Scalable Approach
for Route Guidance in Selfish Orienteering Problems", by Pradeep
Varakantham,
Hala Mostafa, Na Fu and Hoong Chuin Lau. In Proceedings of the Joint
Conference of Autonomous Agents and Multi-Agent Systems, AAMAS-15 (
PDF) 
 |
"Near-Optimal Decentralized
Power Supply Restoration in Smart Grids", by Pritee Agrawal,
Akshat Kumar and Pradeep
Varakantham. In
Proceedings of the Joint Conference of Autonomous Agents and
Multi-Agent Systems, AAMAS-15 (
PDF) 
 |
“Probabilistic Inference Based Message-Passing for
Resource Constrained DCOPs”, by Supriyo Ghosh, Akshat Kumar and
Pradeep Varakantham.
In Proceedings of the International Joint Conference on Artificial
Intelligence, IJCAI-15 (PDF)
 |
"Risk based Optimization for
Improving Emergency Medical Systems", by Sandhya Saisubramanian,
Pradeep
Varakantham and
Hoong Chuin Lau. In Proceedings of Twenty Ninth AAAI Conference on
Artificial Intelligence, AAAI-15 (
PDF) 
 |
"Solving Uncertain MDPs with
Objectives that are Separable over Instantiations of Model
Uncertainty", by Yossiri Adulyasak, Pradeep
Varakantham,
Asrar Ahmed and Patrick Jaillet. In Proceedings of Twenty Ninth AAAI
Conference on Artificial Intelligence, AAAI-15 (
PDF) 
 |
"Robust Execution Strategies
for Project Scheduling under Unreliable Resources and Stochastic
Durations;", by Na Fu , Hoong Chuin Lau and Pradeep
Varakantham.
Accepted for publication in Journal of Scheduling. (PDF)
 |
"Dynamic Redeployment to
Counter Congestion or Starvation in Vehicle Sharing Systems", by
Supriyo Ghosh, Pradeep Varakantham, Yossiri Adulyasak, and Patrick
Jaillet. In Proceedings of International Conference on Automated
Planning and Scheduling, ICAPS-15(
PDF) 
 |
"An Extended Study on Addressing Defender Teamwork while Accounting for Uncertainty in Attacker Defender Games using Iterative Dec-MDPs ", by
Eric Shieh, Albert Xin Jiang, Amulya Yadav, Pradeep Varakantham and Milind Tambe. In the journal of Multiagent and Grid Systems. (
PDF) 
 |
"Incremental DCOP Search Algorithms for Solving Dynamic DCOP Problems ", by
William Yeoh, Pradeep Varakantham, Xiaoxun Sun and Sven Koenig. In Proceedings of Intelligent Agent Technology, IAT-15(
PDF) 
 |
"Learning and Controlling Network Diffusion in Dependent Cascade Models ", by
Jiali Du, Pradeep Varakantham, Akshat Kumar and Shih-Fen Cheng. In Proceedings of Intelligent Agent Technology, IAT-15(
PDF) 
 |
2014
 |
"Decentralized Stochastic
Planning with Anonymity in Interactions", by Pradeep
Varakantham,
Yossiri Adulyasak and Patrick Jaillet. In Proceedings of Twenty
Eighth AAAI Conference on Artificial Intelligence, AAAI-14,(PDF)
 |
"STREETS: Game-Theoretic
Traffic Patrolling with Exploration and Exploitation", by
Matthew Brown, Sandhya Saisubmramanian, Pradeep
Varakantham and
Milind Tambe. In Proceedings of Innovative Applications in Artificial
Intelligence (IAAI) at Twenty Eighth AAAI Conference on Artificial
Intelligence, AAAI-14,(PDF)
 |
"Unleashing Dec-MDPs in
Security Games: Enabling Effective Defender Teamwork", by Eric
Shieh, Albert Jiang, Amulya Yadav, Pradeep
Varakantham and
Milind Tambe. In Proceedings of Twenty First European Conference on
Artificial Intelligence, ECAI-14,(PDF)
 |
"Revisiting Risk-Sensitive
MDPs: New Algorithms and Results;", by Ping Hou, William Yeoh
and Pradeep
Varakantham. In
Proceedings of the International Conference on Automated Planning and
Scheduling (ICAPS-14). (PDF)
 |
"On Understanding Diffusion
Dynamics of Patrons at a Theme Park;", by Jiali Du, Akshat Kumar
and Pradeep
Varakantham.
Extended abstract in the Proceedings of the International Conference
on Autonomous Agents and Multi-Agent Systems (AAMAS-14).(PDF)
 |
"Building THINC: User
Incentivization and Meeting Rescheduling for Energy Savings;",
by Jun Young Kwak, Debarun Kar, William haskell, Pradeep
Varakantham,
Milind Tambe. Proceedings of the International Conference on
Autonomous Agents and Multi-Agent Systems (AAMAS-14). (PDF)
 |
2013
 |
"Regret based Robust
Solutions for Uncertain Markov Decision Processes;", by Asrar
Ahmed, Pradeep
Varakantham,
Yossiri Adulyasak and Patrick Jaillet. In Proceedings of the
conference on Neural Information Processing Systems (NIPS-13). (PDF,
  Supplement)
 |
"Optimization Approaches for
Solving Chance Constrained Stochastic Orienteering Problems;",
by Pradeep
Varakantham and
Akshat Kumar. In Proceedings of the Conference on Algorithmic
Decision Theory (ADT-13). (PDF)
 |
"TESLA: An Extended Study of
an Energy-saving Agent that Leverages Schedule Flexibility;", by
Jun Young Kwak, Pradeep
Varakantham,
Rajiv Maheswaran, Yu-Han Chang, Milind Tambe, Burcin Becerik-Gerber
and Wendy Wood. In the Journal of Autonomous Agents and Multi-Agent
Systems (JAAMAS). (PDF)
 |
"Scalable Randomized
Patrolling for Securing Rapid Transit Networks;", by Pradeep
Varakantham, Lau
Hoong Chuin and Zhi Yuan. In Proceedings of the Conference on
Innovative Applications in Artificial Intelligence (IAAI) at the
National Conference on Artificial Intelligence (AAAI-13). (PDF)
 |
"TESLA: An Energy-saving
Agent that Leverages Schedule Flexibility;", by Jun Young Kwak,
Pradeep
Varakantham,
Rajiv Maheswaran, Yu-Han Chang, Milind Tambe, Burcin Becerik-Gerber
and Wendy Wood. In Proceedings of the International Conference on
Autonomous Agents and Multi-Agent Systems (AAMAS-13). (PDF)
 |
"Budgeted Personalized
Incentive Approaches for Smoothing Congestion in Resource Networks;",
by Pradeep
Varakantham, Fu
Na, William Yeoh, Shih-Fen Cheng and Lau Hoong Chuin. In Proceedings
of the Conference on Algorithmic Decision Theory (ADT). (PDF)
 |
"Marginal Contribution
Stochastic Games for Dynamic Resource Allocation: Bounds and
Convergence;", by Archie Chapman and Pradeep
Varakantham. In
the workshop on Optimization in Multi-Agent Systems (OptMAS) at the
International Conference on Autonomus Agents and Multi-Agent
Systems(AAMAS-13). 
 |
"An Agent-Based Approach to
Dynamic Experience Management in Theme Parks;", by Shih-Fen
Cheng, Larry Lin, Jiali Du, Lau Hoong Chuin and Pradeep
Varakantham. In
the Winter Simulation Conference (WSC-13). 
 |
"Interacting Knapsack Problem
in Designing Resource Bundles;", by Truong Huy Nguyen, Pradeep
Varakantham,Lau
Hoong Chuin and Shih-Fen Cheng. In the Metaheuristics International
Confeence (MIC-13). 
 |
2012
 |
"Lagrangian Relaxation for
Large-Scale Multi-Agent Planning;", by Geoffrey J. Gordon,
Pradeep
Varakantham,
William Yeoh, Hoong Chuin Lau, Ajay S. Aravamudhan and Shih-Fen
Cheng. In Proceedings of the International Conference on Intelligent
Agent Technology (IAT-2012). (PDF)
 |
"Uncertain Congestion Games
with Assorted Human Agent Populations", by Asrar Ahmed, Pradeep
Varakantham and
Shih-Fen Cheng. Twenty Eighth International Conference on Uncertainty
in Artificial Intelligence (UAI-2012). (PDF)
(Acceptance Rate: 31%)
 |
"Dynamic Stochastic
Orienteering Problems for Risk-Aware Applications", by William
Yeoh, Lau Hoong Chuin, Pradeep
Varakantham,
Huaxing Chen and Duc Thien Nguyen. Twenty Eighth International
Conference on Uncertainty in Artificial Intelligence (UAI-2012).
(PDF)
(Acceptance Rate: 31%)
 |
"Robust Local Search for
Solving RCPSP/max with Durational Uncertainty", by Na Fu, Lau
Hoong Chuin Pradeep
Varakantham, and
Xiao Fei. Journal of Artificial Intelligence Research (JAIR).(PDF)
 |
"Decision Support for Agent
Populations in Uncertain and Congested Environments", by Pradeep
Varakantham,
Shih-Fen Cheng, Geoff Gordon and Asrar Ahmed. Twenty Sixth Conference
on Artificial Intelligence (AAAI-2012). (PDF)
(Acceptance Rate: 26%)
 |
"Active Malware Analysis
using Stochastic Games", by Simon Williamson, Pradeep
Varakantham,
Debin Gao and Chen Hui Ong. Eleventh International Conference on
Autonomous Agents and Multi-Agent Systems (AAMAS-2012). (PDF)
(Acceptance Rate: 20%)
 |
"SAVES: A Sustainable
Multi-Agent Application to Conserve Building Energy Considering
Applicants", by Jun-Young Kwak, Pradeep
Varakantham,
Rajiv Maheswaran, Milind Tambe, Farrokh Jazizadeh, Geoffrey Kavulya,
Laura Klein, Burcin Becerik-Gerber, Timothy Hayes and Wendy Wood.
Eleventh International Conference on Autonomous Agents and
Multi-Agent Systems (AAMAS-2012). (PDF)
(Acceptance Rate: 20%)
 |
"Lagrangian Relaxation for
Large Scale Multi-Agent Planning", by Geoff Gordon, Pradeep
Varakantham,
William Yeoh, Lau Hoong Chuin, Ajay Srinivasan and Cheng Shih-Fen.
Poster paper in Eleventh International Conference on Autonomous
Agents and Multi-Agent Systems (AAMAS-2012). (PDF)
(Acceptance Rate: 22%)
 |
"Delayed Observation Planning
in Partially Observable Domains", by
Pradeep Varakantham
and Janusz Marecki. Poster paper in Eleventh International Conference
on Autonomous Agents and Multi-Agent Systems (AAMAS-2012). (PDF)
(Acceptance Rate: 22%)
 |
"Prioritized Shaping of
Models for Solving DEC-POMDPs", by Pradeep
Varakantham,
William Yeoh, Prasanna Velagapudi, Katia Sycara and Paul Scerri.
Poster paper in Eleventh International Conference on Autonomous
Agents and Multi-Agent Systems (AAMAS-2012). (PDF)
(Acceptance Rate: 22%)
 |
"Sustainable Multiagent
Application to Conserve Energy" by Jun-Young Kwak, Pradeep
Varakantham,
Rajiv Maheswaran, Milind Tambe, Farrokh Jazizadeh, Geoffrey Kavulya,
Laura Klein, Burcin Becerik-Gerber, Timothy Hayes and Wendy Wood.
Demonstration paper at Eleventh International Conference on
Autonomous Agents and Multi-Agent Systems (AAMAS-2012).
 |
Coordinating Occupant Behavior for
Building Energy and Comfort Management using Multi-Agent Systems.
Laura Klein, Jun-Young Kwak, Geoffrey Kavulya, Farookh Jazizadeh,
Burcin Bercerik-Gerber, Pradeep
Varakantham and
Milind Tambe. Automation in Construction: An International Research
Journal (To Appear).
 |
2011
 |
"Towards Optimal Planning for
Distributed Coordination Under Uncertainty in Energy Domains",
by Jun-young Kwak, Pradeep
Varakantham,
Milind Tambe, Laura Klein, Farrokh Jazizadeh, Geoffrey Kavulya,
Burcin B. Gerber and David J. Gerber. In AAMAS Workshop on Agent
Technologies for Energy Systems (ATES), May, 2011 
 |
"Decision Support in
Organizations: A Case for OrgPOMDPs", by Pradeep
Varakantham,
Nathan Schurr, Alan Carlin and Christopher Amato. Accepted for
publication in the proceedings of IEEE/WIC/ACM International
Conference on Intelligent Agent Technology (IAT). (PDF)
(Acceptance Rate: 21%) 
 |
"Social Model Shaping for
Solving Generic DEC-POMDPs", by Pradeep
Varakantham.
Accepted for publication in the proceedings of IEEE/WIC/ACM
International Conference on Intelligent Agent Technology (IAT). (PDF)
(Acceptance Rate: 21%) 
 |
"Distributed Model Shaping
for Scaling to Decentralized POMDPs with hundreds of agents", by
Prasanna Velagapudi, Pradeep
Varakantham,
Paul Scerri and Katia Sycara. Accepted for publication in the
proceedings of the Tenth International Joint Conference on Autonomous
Agents and MultiAgent Systems (AAMAS). (PDF)
(Acceptance Rate: 22%) 
 |
"Decentralized Decision
Support for an agent population in dynamic and uncertain domains",
by Pradeep
Varakantham,
Shih-Fen Cheng and Thi Duong Nguyen. Poster paper in proceedings of
the Tenth International Joint Conference on Autonomous Agents and
MultiAgent Systems (AAMAS). (PDF)
(Acceptance Rate: 23%)
 |
"Adaptive Decision Support
for Structured Organizations: A Case for OrgPOMDPs", by Pradeep
Varakantham,
Nathan Schurr, Alan Carlin and Christopher Amato. Poster paper in
proceedings of the Tenth International Joint Conference on Autonomous
Agents and MultiAgent Systems (AAMAS). (PDF) (Acceptance Rate: 23%) 
 |
"Incremental DCOP Search
Algorithms for Solving Dynamic DCOP Problems", by William Yeoh,
Pradeep
Varakantham,
Xiaoxun Sun and Sven Koenig. Poster paper in proceedings of the Tenth
International Joint Conference on Autonomous Agents and MultiAgent
Systems (AAMAS). (PDF) (Acceptance Rate: 23%) 
 |
2010
 |
"A Decision Theoretic
Approach to Data Leakage Prevention", by Janusz Marecki,
Mudhakar Srivastava and Pradeep
Varakantham.
2010, Proceedings of the Second IEEE International Conference on
Information Privacy, Security, Risk and Trust (PASSAT2010). (PDF)
(Acceptance Rate: 13%)
 |
"Effect of human biases on
human-agent teams", by Praveen Paruchuri, Pradeep
Varakantham,
Katia Sycara and Paul Scerri, 2010, Proceedings of the International
Conference on Intelligent Agent Technology (IAT), Toronto, Canada.
(PDF)
(Acceptance Rate: 18.8%)
 |
"Analyzing the impact of
human bias on human-agent teams in resource allocation", by
Praveen Paruchuri, Pradeep
Varakantham,
Katia Sycara, and Paul Scerri, 2010, Poster Paper in Proceedings of
Ninth International Joint Conference on Autonomous Agents and Multi
Agent Systems (AAMAS), Toronto, Canada. (PDF)
(Acceptance Rate: 18%)
 |
"Risk-Sensitive Planning in
Partially Observable Domains", by Janusz Marecki and Pradeep
Varakantham,
2010, Proceedings of the Ninth International Joint Conference on
Autonomous Agents and MultiAgent Systems (AAMAS), Toronto, Canada.
(PDF)
(Acceptance Rate: 23.9%)
 |
"Towards Finding Robust
Execution Strategies for RCPSP with Durational Uncertainty", by
Na Fu, Pradeep
Varakantham, and
Hoong Chuin Lau, 2010, Proceedings of the Twentieth International
Conference on Automated Planning and Scheduling (ICAPS). (PDF)
(Acceptance Rate: 34%)
 |
"Introducing Communication in
Dis-POMDPs with Locality of Interaction", by Makoto Tasaki,
Yuichi Yabu, Yuki Iwanuri, Makoto Yokoo, Janusz Marecki, Pradeep
Varakantham,
Milind Tambe, 2010, Journal of Web Intelligence and Agent
Systems(WIAS), 2010 Vol. 8, No. 3 pp 8.
 |
2009
 |
"Caching Schemes for DCOP
Search Algorithms", by William Yeoh, Pradeep
Varakantham, and
Sven Koenig, 2009, Proceedings of the Eighth International Conference
on Autonomous Agents and Multi Agent Systems, AAMAS. Nominated
for Jay Modi Best Student Paper Award. (PDF)
(Acceptance Rate: 22%)
 |
"Exploiting Coordination
Locales in Distributed POMDPs via Social Model Shaping", by
Pradeep
Varakantham, Jun
Young Kwak, Matthew Taylor, Janusz Marecki, Paul Scerri, and Milind
Tambe. Proceedings of the Nineteenth International Conference on
Automated Planning and Scheduling (ICAPS), Thessaloniki, Greece from
Sept 19-23, 2009. (PDF)
(Acceptance Rate: 34%)
 |
2008
 |
"Linear Relaxation Techniques
for Task Management in Uncertain Settings", by Pradeep
Varakantham and
Stephen Smith, 2008, Proceedings of the Eighteenth International
Conference on Automated Planning and Scheduling, ICAPS. (PDF)
(Acceptance Rate: 31.2%)
 |
"Introducing Communication in
Dis-POMDPs with Locality of Interaction", by Makoto
Tasaki, Yuichi Yabu, Yuki Iwanari, Makoto Yokoo, Milind Tambe, Janusz
Marecki and Pradeep Varakantham,
IEEE/WIC/ACM International Conference on Intelligent Agent Technology
(IAT-2008), 2008. (PDF)
(Acceptance Rate: 19%)
 |
"Not All Agents are Equal:
Scaling up Distributed POMDPs for Agent Networks", by Tapana
Gupta, Janusz Marecki, Pradeep
Varakantham, and
Makoto Yokoo, 2008, Proceedings of the Seventh International
Conference on Autonomous Agents and MultiAgent Systems, AAMAS. (PDF)
(Acceptance Rate: 22%)
 |
"What went wrong and why",
by Milind Tambe, Emma Bowring, Jonathan Pearce, Pradeep
Varakantham,
David Pynadath, and Paul Scerri, 2008, AI Magazine Article.
 |
2004-2007
 |
"Towards efficient planning
for real world partially observable domains", by Pradeep
Varakantham ,
Dissertation for
Doctor of Philosophy in
Computer Science, University of Southern California, Los Angeles, CA,
02/2007.
 |
"Letting loose a SPIDER on a
network of POMDPs: Generating quality guranteed policies", by
Pradeep
Varakantham,
Janusz Marecki, Makoto Yokoo, and Milind Tambe, 2007. Proceedings of
the Sixth International Joint Conference on Autonomous Agents and
Multi Agent Systems, AAMAS. (PDF)
(Acceptance Rate: 22.8%)
 |
"Towards efficient
computation of quality bounded solutions in POMDPs", by Pradeep
Varakantham,
Rajiv Maheswaran, Tapana Gupta, and Milind Tambe, 2007, Proceedings
of the Twentieth International Joint Conference on Artificial
Intelligence, IJCAI. (PDF)
(Acceptance Rate: 15.7%)
 |
"Winning back the CUP for
Distributed POMDPs: Planning over continuous belief spaces", by
Pradeep
Varakantham,
Ranjit Nair, Milind Tambe, and Makoto Yokoo, 2006. Proceedings of the
Fifth International Conference on Autonomous Agents and Multi Agent
Systems, AAMAS. (PDF)
(Acceptance Rate: 23.1%)
 |
"Privacy Loss in Distributed
Constraint Reasoning: A Quantitative Framework for Analysis and its
Applications", by Rajiv Maheswaran, Jonathan Pearce, Emma
Bowring, Pradeep
Varakantham, and
Milind Tambe, 2006, Journal of Autonomous Agents and Multi-Agent
Systems, JAAMAS.
 |
"Hybrids in Multiagent
Teamwork", by Praveen Paruchuri, Emma Bowring, Ranjit Nair,
Jonathan Pearce, Nathan Schurr, Milind Tambe, Pradeep
Varakantham, and
Rajiv Maheswaran, 2006, 19-24, Communications of the Computer Society
of India. (PDF)
 |
"Exploiting Belief Bounds:
Practical POMDPs for Personal Assistant Agents", by Pradeep
Varakantham,
Rajiv Maheswaran, and Milind Tambe, 2005, Proceedings of the Fourth
International Conference on Autonomous Agents and Multi Agent
Systems, AAMAS. (PDF)
(Acceptance Rate: 24.5%)
 |
"Networked Distributed
POMDPs: A Synthesis of Distributed Constraint Optimization and
POMDPs", by Ranjit Nair, Pradeep
Varakantham,
Milind Tambe, and Makoto Yokoo, 2005, Proceedings of the Twentieth
National Conference on Artificial Intelligence, AAAI. (PDF)
(Acceptance Rate: 18.4%)
 |
"Valuations of Possible
States (VPS): A Unifying Quantitative Framework for Evaluating
Privacy in Collaboration", by Rajiv Maheswaran, Jonathan Pearce,
Pradeep
Varakantham,
Emma Bowring, and Milind Tambe, 2005, Proceedings of the Fourth
International Conference on Autonomous Agents and Multi Agent
Systems, AAMAS. (PDF)
(Acceptance Rate: 24.5%)
 |
"Conflicts in teamwork:
Hybrids to the rescue", by Milind Tambe, Emma Bowring, Hyuckchul
Jung, Gal Kaminka, Rajiv Maheswaran, Janusz Marecki, Pragnesh Modi,
Ranjit Nair, Jonathan Pearce, and Pradeep
Varakantham,
2005, Proceedings of the Fourth International Joint Conference on
Autonomous Agents and Multiagent Systems, AAMAS. (PDF)
 |
"Taking DCOP to the Real
World: Efficient Complete Solutions for Distributed Event
Scheduling", by Rajiv Maheswaran, Milind Tambe, Emma Bowring,
Jonathan Pearce, and Pradeep
Varakantham,
2004, Proceedings of the Third International Conference on Autonomous
Agents and Multi Agent Systems, AAMAS. (PDF)
(Acceptance Rate: 24.6%) 
|