Projects Skills Education Achievements Resume

Teras is an opensource one-of-a-kind deep learning library for Tabular data. Check it out!

I also blog about Machine Learning on medium. Check them out!

PROJECTS

Basically, Github Copilot that can run locally on a laptop.

Built a custom Gemma 2 270M variant.
Utilizing the knowledge distillation, trained the base model first on english dataset (FineWebEdu) using Gemma 2 2B as teacher using the classic next word prediction technique.
In the second training phase, used CodeGemma 2B as teacher to train the pretrained 270M base model on codecontest dataset using the Fill in the Middle technique.

Technologies Used:

JAX, Flax, and 8 H100s

Teras is a unified deep learning library for Tabular Data that aims to be your one stop for everything related to deep learing with tabular data.

Checkout the official repo, https://github.com/KhawajaAbaid/teras/

It provides state of the art layers, models and arhitectures for all purposes, be it classification, regression or even data generation and imputation using state of the art deep learning architectures.
While these state of the art architectures can be quite sophisticated, Teras, thanks to the incredible design of Keras, abstracts away all the complications and sophistication and makes it easy as ever to access those models and put them to use.

Technologies Used:

Keras, TensorFlow, PyTorch, JAX

Try this out in a browser, https://khawajaabaid.github.io/prewordict

Processed 180,000 medium articles and clustered the similar words
Generated word clouds from these clusters. Given a word cloud, user has to predict one word that best belongs in the word cloud.

Technologies Used:

Scikit-Learn (TFIDF, SVD, KMeans), WordCloud, Numpy, Pandas, Matplotlib, Kaggle

Consists of three main modules:

FPS Predictor: Predicts the expected amount of FPS for a user-specified video game on a user specified PC configuration.
Games Recommender: Recommends video games based on user specified PC configuration. Uses a simple yet working ranking algorithm. (currently only CPU and GPU)
PC Predictor: Recommends PC components based on user specified video games. Uses pretty much the same ranking algorithm but in reverse. (currently only CPU and GPU)

Technologies Used:

Keras, XGBoost, Pandas, Numpy, Django

Project Demo Video Link (which I was required to submit for completing CS50X)

Worked on Computer Vision to train Deep Learning models for Person Detection as well as Detection of Certain (similarly attired) Groups among crowd.
Developed custom algorithms for Chaos Detection and Crowd Detection

Technologies Used:

TensorFlow Object Detection API, OpenCV, Matplotlib, PyQt5, Google Colab

Scraped games info and their FPS data from the internet (Steam and other sites)
Used Random Forest algorithm to build the FPS predictor

Technologies Used:

Scikit-Learn, Pandas, Requests, Beautiful Soup

Check this out here, https://twitter.com/TweetsCloudBot

Uses Twitter API to get 100 most recent user tweets
Generate a user-customizable tweets cloud (word cloud of most frequent words) in tweets, filtering the stop words.

Technologies Used:

Twitter API, Tweepy, NLTK, WordCloud, matplotlib

EDUCATION

Master of Science in Artificial Intelligence

Rochester Insitute of Technology
In Progress. Current CGPA=4.0

Bachelor of Science in Computer Science

University of Narowal
Graduated as the topper of my computer science class!

Certificates:

Havard CS50X

SKILLS

Programming: Python, JavaScript, C, C++, C#
Machine Learning: Keras, JAX, Tensorflow, PyTorch, Scikit-Learn, XGBoost
Data Wrangling: Pandas, Numpy, Scipy, Statsmodels, NetworkX
Visualization: Matoplotlib, Seaborn
NLP: AllenNLP, NLTK, Spacy, Hugging Face
Database: SQL, MongoDB
Cloud: AWS EC2, Google Colab, Kaggle Notebooks
Web Scraping: Requests, Beautiful Soup, Selenium
Others: Django, PyQt5, Git, OpenCV, Photoshop, MS Office

ACHIEVEMENTS

Fulbright Scholarship
For Masters in Machine Learning 2024-26

Replied by Yann LeCun twice on twitter
Kaggle Expert

PROJECTS

codriver - A efficient LLM based code genration tool.

Basically, Github Copilot that can run locally on a laptop.

Technologies Used:

Teras / Deep learning library for Tabular Data

Teras is a unified deep learning library for Tabular Data that aims to be your one stop for everything related to deep learing with tabular data.

Checkout the official repo, https://github.com/KhawajaAbaid/teras/

Technologies Used:

Prewordict / A fun little nerdy game (Inspired by Wordle)

Try this out in a browser, https://khawajaabaid.github.io/prewordict

Technologies Used:

Gamer Assistant - Addressing the unaddressed needs of gamers | CS50X Project

Consists of three main modules:

Technologies Used:

PROLEAs - A Crowd Detection and Behavior Analysis System | Final Year Project

Technologies Used:

Video Games FPS Predictor (v1)

Technologies Used:

Tweets Cloud Bot (Inspired by Spotify Wrapped 2021)

Check this out here, https://twitter.com/TweetsCloudBot

Technologies Used:

EDUCATION

Master of Science in Artificial Intelligence

Rochester Insitute of Technology

Bachelor of Science in Computer Science

University of Narowal

Certificates:

Havard CS50X

SKILLS

ACHIEVEMENTS

Fulbright Scholarship

Replied by Yann LeCun twice on twitter

Kaggle Expert