Teras is an opensource one-of-a-kind deep learning library for Tabular data. Check it out!
I also blog about Machine Learning on medium. Check them out!
PROJECTS
Basically, Github Copilot that can run locally on a laptop.
- Built a custom Gemma 2 270M variant.
- Utilizing the knowledge distillation, trained the base model first on english dataset (FineWebEdu) using Gemma 2 2B as teacher using the classic next word prediction technique.
- In the second training phase, used CodeGemma 2B as teacher to train the pretrained 270M base model on codecontest dataset using the Fill in the Middle technique.
Technologies Used:
JAX, Flax, and 8 H100s- It provides state of the art layers, models and arhitectures for all purposes, be it classification, regression or even data generation and imputation using state of the art deep learning architectures.
- While these state of the art architectures can be quite sophisticated, Teras, thanks to the incredible design of Keras, abstracts away all the complications and sophistication and makes it easy as ever to access those models and put them to use.
Teras is a unified deep learning library for Tabular Data that aims to be your one stop for everything related to deep learing with tabular data.
Checkout the official repo, https://github.com/KhawajaAbaid/teras/
Technologies Used:
Keras, TensorFlow, PyTorch, JAX- Processed 180,000 medium articles and clustered the similar words
- Generated word clouds from these clusters. Given a word cloud, user has to predict one word that best belongs in the word cloud.
Try this out in a browser, https://khawajaabaid.github.io/prewordict
Technologies Used:
Scikit-Learn (TFIDF, SVD, KMeans), WordCloud, Numpy, Pandas, Matplotlib, KaggleConsists of three main modules:
- FPS Predictor: Predicts the expected amount of FPS for a user-specified video game on a user specified PC configuration.
- Games Recommender: Recommends video games based on user specified PC configuration. Uses a simple yet working ranking algorithm. (currently only CPU and GPU)
- PC Predictor: Recommends PC components based on user specified video games. Uses pretty much the same ranking algorithm but in reverse. (currently only CPU and GPU)
Technologies Used:
Keras, XGBoost, Pandas, Numpy, DjangoProject Demo Video Link (which I was required to submit for completing CS50X)
- Worked on Computer Vision to train Deep Learning models for Person Detection as well as Detection of Certain (similarly attired) Groups among crowd.
- Developed custom algorithms for Chaos Detection and Crowd Detection
Technologies Used:
TensorFlow Object Detection API, OpenCV, Matplotlib, PyQt5, Google Colab- Scraped games info and their FPS data from the internet (Steam and other sites)
- Used Random Forest algorithm to build the FPS predictor
Technologies Used:
Scikit-Learn, Pandas, Requests, Beautiful SoupCheck this out here, https://twitter.com/TweetsCloudBot
- Uses Twitter API to get 100 most recent user tweets
- Generate a user-customizable tweets cloud (word cloud of most frequent words) in tweets, filtering the stop words.
Technologies Used:
Twitter API, Tweepy, NLTK, WordCloud, matplotlibEDUCATION
-
Master of Science in Artificial Intelligence
Rochester Insitute of Technology
In Progress. Current CGPA=4.0 -
Bachelor of Science in Computer Science
University of Narowal
Graduated as the topper of my computer science class!
Certificates:
-
Havard CS50X
SKILLS
- Programming: Python, JavaScript, C, C++, C#
- Machine Learning: Keras, JAX, Tensorflow, PyTorch, Scikit-Learn, XGBoost
- Data Wrangling: Pandas, Numpy, Scipy, Statsmodels, NetworkX
- Visualization: Matoplotlib, Seaborn
- NLP: AllenNLP, NLTK, Spacy, Hugging Face
- Database: SQL, MongoDB
- Cloud: AWS EC2, Google Colab, Kaggle Notebooks
- Web Scraping: Requests, Beautiful Soup, Selenium
- Others: Django, PyQt5, Git, OpenCV, Photoshop, MS Office
ACHIEVEMENTS
-
Fulbright Scholarship
For Masters in Machine Learning 2024-26 Replied by Yann LeCun twice on twitter
Kaggle Expert