Projects

Projects - Professional /Internships

At Tiger Analytics as ML Engineer

Systrans AI Assistant

Developed an Agentic RAG-based chatbot to answer user queries related to a billing guide by orchestrating multiple agents including a billing guide agent, SQL agent for dynamic data retrieval, a summarizer, and a manager to oversee task coordination.
Architected and implemented an asynchronous SQL data layer to efficiently persist chat histories and ensure scalability.
Designed and developed RESTful APIs to support chat interactions and database operations.
Built a Phoenix dashboard to enable real-time observability of the chatbot’s operations.
Contributed to scoping and setting up Azure infrastructure to support deployment, scaling, and monitoring of the chatbot.

Bug Risk Prediction

Deployed a model to predict out-of-sprint bugs using Jira data.
The model consumes features such as past bug prevalence in an epic, assignee’s past performance, and word-level features of the story description.
Used Jenkins to schedule a job that runs the model periodically (everyday in offshore hours) to score stories that are in "ready" status. The results are stored in a database and visualized through a Grafana dashboard.
Computed SHAP values along with predictions to help scrum masters identify key factors contributing to bug risk.
Developed a retraining pipeline scheduled to run periodically, updating the model with new data.
Developed a performance monitoring pipeline scheduled to run periodically to compute the model’s performance on updated data.

Jira/Epic Story Generation

Contributed to the development of a RAG-based system that automatically generates Jira stories and epics based on provided descriptions.
Leveraged OpenAI APIs to extract relevant context and formulate structured story and epic outputs.
Ensured the generated outputs were aligned with project requirements to reduce manual effort in story creation.

Confluence Chatbot

Contributed to the development of a chatbot that answers user queries regarding the company’s internal policies by leveraging RAG techniques to retrieve relevant information from Confluence pages.
Developed the chatbot interface using Chainlit to enable a seamless conversational experience.
Worked on improving retrieval relevance and chatbot responsiveness to enhance overall user experience.

At Decimal Point Analytics as Data Scientist

CSV Agent

Developed a CSV agent utilizing large language models to facilitate natural language interaction with Excel files.
Designed the agent to understand user queries and generate goals based on their input. The agent writes code to execute actions corresponding to the defined goals, allowing seamless interaction with CSV data.

Dolat Summarization

Contributed to the fine-tuning of a large language model on consumer GPUs to generate summarizations of earnings calls.
Leveraged advanced techniques, such as LoRA, to optimize the training process for large models on consumer-grade hardware.
Collaborated with a team to work with state-of-the-art models like Pythia, MPT, GPT-Neo-X, and T5 Flan, achieving significant progress in the project.

Semantic Datatype Detection

Designed and developed a high-accuracy transformer model for automated classification of column data types, leveraging a dataset of 800K data points from both online and internal sources.
Achieved an impressive 94% accuracy on the test set, demonstrating the model’s robustness and effectiveness. In addition, trained a LSTM model with an embedded layer to learn character embeddings, which achieved a solid accuracy of 85%.
Conducted rigorous experimentation and analysis to optimize model performance and improve accuracy.

ESG Classifier

Worked on a project with the objective of developing a text classifier capable of accurately categorizing text into three distinct genres: environmental, social, and governance.
Incorporated the GradCam technique to visualize the specific phrases that the BERT model focused on during the classification process, enhancing interpretability and transparency of the final output.

Tiger automation

Worked on a project to automate the appraisal process by developing a robust data processing pipeline using PySpark.
Leveraged Snowflake database as the primary data source and performed all calculations and transformations using PySpark's distributed processing capabilities.

PDF2Excel

Worked on development of a cutting-edge ML pipeline for converting PDF tables to Excel, utilizing OCR and advanced layout detection models. Leveraged DiT (document image transformer) to accurately detect the layout of PDFs and extract tabular data.

AutoML

Working on an end-to-end pipeline that automates diverse ML processes, including preprocessing, feature selection, model selection, and hyperparameter tuning.
Developed with a vision to democratize model training, the product aims to empower individuals with non-technical backgrounds, making it effortless and accessible for them to engage in machine learning.

At SchoolHack as AI/ML Engineer

IQL for ChatGPT

Used Implicit Q learning to pick up engaging answers from ChatGPT
Automated IQL finetuning pipelines using AWS EC2 and AWS S3 bucket
The IQL is performed periodically just by calling an API and this API is responsible to pull data from S3, finetune it using IQL on an ec2 instance and update the weights

Live Translation from English to other foreign Langugaes

Used seamless-m4t model (developed by Meta) to translate ChatGPT/llama's replies into other foreign languages, mainly Arabic.
Seamless-m4t requires the label for input language to do the translation. Hence, a separate language detector, xlm-roberta is deployed alongside.
To handle the traffic, the model is deployed on two g4dn xlarge GPUs using AWS sagemaker service.

Llama2 and Llama3 Finetuning

As a leading LLM application, School Hack generated a lot of data everyday.
Finetuned Llama2 7b and Llama3 8b models on a 1M chat datapoints
The training is done on G5.48xlarge instance on AWS.

At Decimal Point Analytics as Software Development Intern

Blockchain based Chat and Bid Application

Designed and developed a cutting-edge blockchain-based chat and bidding application, leveraging the power of Hedera Hashgraph.
Utilized Hedera Hashgraph's JavaScript API to build a secure and robust application that enabled seamless messaging and bidding transactions.

Projects - Academic/Personal

Automation of Cleaning cervical data using deep learning techniques:

Developed a supervised contrastive model to filter outliers in a cervical image dataset resulted in superior performance when compared to human cleaning. These impressive findings were published in the prestigious IEEE Access journal.
EfficientCenterDet: A novel Self supervision boosted RoI proposal network for cervix type detection [code]:

A fully automated self-supervised pipeline has been developed for the detection of cervical cancer. This impressive feat was achieved by leveraging a novel object detector, which drew inspiration from both the efficientdet architecture and centrenet loss. These impressive findings were published in the prestigious International Journal of Imaging Systems and Technology.
Covid-19 detection from CT scans [code]:

I successfully designed and implemented an advanced EfficientNet architecture that accurately predicts Covid-19 infection through CT scans. To ensure optimal performance, I employed a BCD U-net for efficient segmentation of the region of interest. These findings were communicate to a conference.
Cassava Leaf disease classification [code]:

I undertook a challenging Kaggle competition by implementing a variety of advanced models, including Vision Transformer, EfficientNets, and ResNets, all trained using Bi-Tempered Loss. To achieve even greater accuracy, I utilized an ensemble of these models in conjunction with Test Time Augmentation (TTA).
Stock Market prediction with tabnet [code]:

Successfully trained tabnet architecture, (original developed by google AI cloud) for regressing over a complex tabular data. Along with tabnet, I also trained gradient boosted tree algorithms like xgboost, catboost. Also, trained RNN for puts call ratio from historical data. I leveraged self supervised methods to handle missing values and ensure the highest level of model accuracy.
Tweet Sentiment Extraction [code]: Worked on a project to extract key phrases given the sentiment from tweets, utilizing multiple advanced transformers, including XLNet, RoBERTa, and alBERT. To achieve even greater performance, I implemented an ensemble of these models, further enhancing my model’s predictive power
Human Activity Recognition using 2D pose [code]: For this project, I tackled the challenging task of detecting human activities from video data. To achieve this, I utilized the powerful pose recognition model, Posenet, as a starting point, and built a custom Convlstm head on top of it. This model was then fine-tuned using a data input of 20 frames at a time, allowing for greater accuracy in activity detection.
Multi task learning for self driving cars:

Developed a single neural network that can perform object detection, segmentation and depth perception using IDD dataset

Roles, Responsibilities and Interests

Professional Level:

Domain Knowledge:

Machine Learning
Deep Learning
Artificial Intelligence
Cloud Computing

Responsibilities:

Exploration of SOTA Models
Research Paper reading
Approaches development for the use case
Helping SA’s in Architecture Development
Model Development Research
Model Development, Training, and Evaluation
Model Optimisation (Quantization and Pruning)
Model Deployment
Integrating model deployments with other cloud components
Pipeline regular maintenance metadata scripts

Personal (Passion) level:

Interests:

NLP (LLMs)
Machine learning - Tradition + Deep Learning
Reinforcement learning
Explainable AI