CV
Check the pdf version of my CV here.
Research Interests
Meta Learning / NLP / Algorithmic Game Theory / AI for Healthcare
The list is by means not exhaustive, as I am always open to dabble into new areas 🚀.
Education
- B.Tech., Electronics and Communication Engineering, IIITN
- GPA: 8.7/10 (Rank: 5)
- Duration: July’17 - May’21 (4 Years)
- B.Tech.: Bachelors of Technology
- IIITN: Indian Institute of Information Technology, Nagpur.
Research Experience
- Computing Systems Lab, Tata Research & Innovation Labs, Mumbai
- Topic: Accelerating Gradient Based Meta Learners
- Guide: Dr. Rekha Singhal, Principle Scientist & Head of Lab
- Duration: 6 months (May’20 - Nov’20)
- Optimized system and model performance of a RNN based meta learner, thus achieving a speed up of 4x on training.
- Contributed to an open-source meta learnning library to generalize performance optimization achieved on MAML, ANIL, etc. Additionally, incorporated Data Parallel & Distributed settings.
- Health Language Processing Lab, University of Pennsylvania
- Topic: Multi-task learning on multi-corpus health dataset
- Guide: Dr. Graciela Gonzalez-Hernandez, Associate Professor
- Duration: 10 months (Jul’20 - Sept’20), (Jan’21 - Aug’21)
- Implemented and improved on existing algorithms in literature to achieve SOTA performance for extraction of health-related information (eg., age, gender, location, symptoms, etc.) using Multi-task learning from multi-corpus twitter health dataset.
- Participated in WNUT’20 shared task, where our system ranked 13th all over the world. The task was to classify COVID-19 tweets as informative or not information. DOI
- Game Theory Lab, Indian Insitute of Technology, Gandhinagar
- Topic: Multiwinner voting with Admissible Sets
- Guide: Dr. Neeldhara Misra, Assistant Professor
- Duration: 5 months (Nov’18 - Jan’19, May’19 - July’19)
- Contributed a lemma to reduce the NP-Hard problem of winner determination for connected property of candidate graph for NSAV/NAV scoring functions to polynomial time solvable for a fixed size of voters. [Lemma]
- Performed extensive research in the fields of Fair Division, Voting and studied to hardness of various problems. [Research Notes: Voting, Fair Division]
Publications
Undergraduate Thesis
- Topic: Geolocation Extraction of Twitter users
- Guide: Dr. Graciela Gonzalez-Hernandez, Dr. Arjun Magge
- Overview: Non-availability of well annotated and balanced datasets is considered as one of the major hurdles in analysing and extracting meaningful information from health-related tweets. In the past few years, the importance of geolocation systems have increased in health care applications, advertisments, natural disaster management, etc. Herein, we present a system which makes use of the location metadata such as geo, place and profile tags; and make use of user tweets to predict the structured location of a given Twitter user. Our system is based on the previous systems Carmen and Pigeo, and we overcome some of the drawbacks present in these systems while retaining the advantages of these systems. Furthermore, we make use of FasTText and Transformer models to build a text classifier for classifying between states in a particular country. The dataset for the same is prepared by making use of Tweepy.
- Thesis/Paper: to be updated soon …
- Additional Material: Code / Poster / Presentation (Progress) / Presentation (Final)
Projects
- Tree MAML & Hierarchical MT-NET (ongoing work)
- Guide: Sébastien Arnold, Dr. Tausif Diwan
- Motivation: Build efficient algorithms that surpass SOTA performance on existing CV benchmark datasets.
- (to be updated soon)
- Metabolts: A library for high-end research in Meta Learning
- Guide: Sébastien Arnold
- Metabolts is a software library for meta-learning research. It builds on top of PyTorch Lightning to accelerate two aspects of meta-learning research cycle: fast-prototyping and correct reproducibility. We benchmarked results on MAML, ANIL, ProtoNets and MetaOPTs.
- Code Link: Github Link
- WNUT 2020 Shared Task 2 : Classification of COVID Tweets
- Guide: Dr. Graciela Gonzalez-Hernandez, Dr. Arjun Magge
- Proposed and designed a model based on BERT Transformer which included prior annotation and preprocessing of dataset to get a F1-score of 0.966. Made use of Flair framework for additional performance gain. Our model ranked $13^{th}$ accuracy-wise in the shared task which saw hundreds of participating models.
Teaching Assistanship
Conferences & Summer Schools Attented
- 38th IARCS Annual Conference on FSTTCS’18
- 10th December 2018 - 14th December 2018
- ACM-W Summer School on Algorithmic Game Theory
- 2th July 2019 - 14th July 2019
- CSA Summer School, IISC
- 15th July 2019 - 20th July 2019
- Annual Conference of the NAACL’21 (NAACL-HLT)
- 6th June 2021 - 11th June 2021
Achievements and leadership
- Bagged best research intern diary award at Summer Research Internship Program, IIT Gandhinagar.
- Received honorable mention for the project “Paraclate” in NASA Space Settlement Contest, USA.
- My team was in top 3 performing teams for 2 of the 3 shared tasks at SMM4H’21 Workshop.
- I was a student placement coordinator at my institute for 1.5 years. I mainly dealt with establishing network with companies, and maintaining database of upcoming internships/campus drives.
- Activities at Undergraduate Institute:
- I was a part of the founding committee of IIITN’s coding club community dotSlash (./).
- I was the co-head of the Line Follower event in IIITN’s technical fest (Tantrafiesta).
- I was the head for Samarth event in IIITN’s cultural fest (Abhivyakti). Samarth is a theme event of Abhivyakti that seeks to spread awareness about the physically challenged children in India.
Technical Skills
- Programming Languages C | Python | HDL
- Frameworks Flair | KTrain | learn2learn | HuggingFace
- Libraries Keras | Tensorflow 2.0 | PyTorch
- Systems/Platforms Git | Bash | Linux | CentOS
- Markup Templating HTML | CSS | Jekyll
- Data Visualization Seaborn | Matplotlib
- Miscellaneous Softwares/Tools Latex | MATLAB | RPi | CADFEKO