Projects

Autonomous RL-DL Agent for Realtime multiplayer SuperTuxKart Ice-Hockey

April 2024

Team: Emmanuel Rajapandian, Jean Del Rosario Pegeuro, Shubham Gupta

Resources: [Technical report]

Summary: We design an automated agent to play SuperTuxKart Ice Hockey which is a game featuring a vast state space, a diverse action space, and sparse rewards, presenting a highly formidable challenge. The objective of the agent is to maximize goal scoring in any difficulty and achieve victory in the match if possible. Our approach involves using imitation learning, combining both Behavioural Cloning and DAgger, to mimic other agents and learn the optimal strategy for playing the game. We employ REINFORCE on top of our best imitation agent to adjust the variables of the agent's policy in an approach that increases the likelihood of actions that result in higher rewards 1.e., an effective goal-scoring strategy. Our system is designed to exploit potential simplifications in this complex environment, with the ultimate aim of creating proficient players. We train a neural net model, inspired by the principles of imitation learning, to support a controller network to play ice hockey. Our system is state-based, focusing on the state of the game rather than the visual input from the player's field of view.

My contribution: Generated training dataset of over 300 episodes of agent gameplay for imitation learning, coded and trained multi-layer perceptron model using imitation learning combining Behavioural modelling and DAgger, wrote major sections of project report.




Image based vision system for a racing simulator - SuperTuxKart

February 2024

Team: Emmanuel Rajapandian

Summary: Developed an image-based vision system for a racing simulator, SuperTuxKart by leveraging deep learning using Python. The initial phase involved implementing a CNN classifier using PyTorch. Employing convolutional layers, ensured effective feature extraction for classification tasks. Through hyperparameter tuning and employing various techniques such as input normalization, residual blocks, dropout, and data augmentation, the model achieved an accuracy of 94/100 on validation data. Transitioning from image classification to dense prediction, the CNN was converted to a Fully Convoluional Network (FCN) for semantic segmentation. This FCN is designed to handle arbitrary input resolutions and outputs predictions per pixel. Employing skip and residual connections ensured accurate object detection and segmentation. Expanding the segmentation network, it was repurposed into point-based object detection. Predicting dense heatmaps of object centers enabled efficient object localization. Implementing peak extraction algorithms facilitated the identification of relevant objects such as projectiles, karts, bombs, nitro within the scene, further enhancing the system's understanding of the environment. Beyond vision processing, a low-level controller responsible for steering the racing vehicle towards a specified aim point was integrated with the image-based predictions along with control actions, to ensure smooth and efficient navigation across diverse racing tracks. Lastly, the planner model was trained to predict aim points directly from input images. Overall, the sequential culmination of these components resulted in a high-performance vision-based system for racing simulation. This project helped me learn lot about systems that showcase the superiority of deep learning in enhancing virtual game environments. Owing to UT Austin Honor Code, the model files will not be made public.

My contribution: Sole contributor.




RAG Chatbot using Falcon 7B LLM using Langchain

January 2024

Team: Emmanuel Rajapandian

Resources: [Code]

Summary: Leveraged the capabilities of LangChain, 8-bit quantized Falcon-7B LLM, Chroma DB to create a sophisticated chatbot developed and equipped with the capacity to learn from the external world through Retrieval Augmented Generation (RAG). Speed-cubing has been a part of my hobbies for the past 10 years and I decided to build the chatbot by extracting data from the official WCA website, it guidelines, regulations and other data. This dataset forms the bedrock, enabling the chatbot to respond adeptly to queries regarding the queries related to speed-cubing, its rules and guidelines. Built a functioning chatbot (notebook version for now) and RAG pipeline that can hold a conversation and provide informative responses based on the WCA data.The exploration promises a deep dive into the construction of an intelligent conversational agent.

My contribution: Sole contributor.