Available for Work

Hello, I'm

Aditya Inamdar

AI Engineer &

Specializing in Hierarchical Reasoning Models, Transformers, and Scalable Web Applications. Transforming complex AI research into practical solutions.

View Work

Download CV

About Me

I am a Computer Science graduate student at Felician University with a strong background in Artificial Intelligence and Full Stack Development. My expertise spans from building reasoning models and transformer architectures from scratch to managing IT operations and developing React Native applications.

With experience ranging from Google Summer of Code to leading development teams, I am passionate about pushing the boundaries of what's possible with code.

4+ Years Experience

5+ Major Projects

Technical Arsenal

AI & Machine Learning

PyTorch
TensorFlow
Transformers
LLMs
Computer Vision
RL & GANs

Languages

Python
Java
JavaScript (ES6+)
TypeScript
C++
SQL

Full Stack Dev

React & React Native
Node.js
Express
Django
REST APIs
HTML5/CSS3

Cloud & DevOps

Docker
Kubernetes
AWS (EC2, S3)
Google Cloud
Git & GitHub
CI/CD

Experience

Jan 2026 – Present

Forward Deployed Engineer (Intern)

Mainly.ai

Collaborating with cross-functional teams to deploy scalable AI solutions into client production environments.
Optimizing model inference pipelines to reduce latency and improve throughput for real-time applications.
Troubleshooting and resolving complex integration issues within Docker and Kubernetes orchestrations.

Jan 2025 – Present

Graduate Assistant (Admin Supervisory Assistant)

Felician University | Rutherford, NJ

Directed IT helpdesk operations and a team of 4 technicians. Managed 20+ daily tickets to achieve 95% user satisfaction and improved team accuracy by 25%.

Oct 2024 – Dec 2024

Student Ambassador IT Support

Felician University | Rutherford, NJ

Provided technical support by resolving 2000+ helpdesk tickets, maintaining a 95% satisfaction rating, and assisting in supervising campus lab equipment and student workers.

Jan 2024 – May 2024

Application Developer Intern

ISKCON Pune, Maharashtra

Managed a team of 4 developers in building a React Native application. Streamlined the development process, cutting the initial launch timeline by 30%.

June 2023 – Dec 2023

Google Summer Of Code (GSoC) '23 Developer

Google | Pune, Maharashtra

Developed a deep learning model for bone cancer detection using Python and TensorFlow (15% accuracy improvement). Engineered an optimized data preprocessing pipeline reducing image analysis time by 30%.

Featured Projects

Hierarchical Reasoning Model

Associated with Mainly.ai

Engineered a brain-inspired model with coupled recurrent modules for abstract planning. Achieved SOTA performance (40.3% on ARC-AGI) with memory-efficient one-step gradient approximation.

ARC-AGI Recurrent Modules Research

Transformer from Scratch

Associated with Mainly.ai

Implemented a full Transformer using NumPy/PyTorch. Integrated multi-headed self-attention and achieved a SOTA BLEU score of 40 on English-German translation.

PyTorch NumPy NLP

Vision Transformer

Associated with Mainly.ai

Adapted Transformer architecture for CV with patch embedding and class tokens. Achieved 85% top-1 accuracy on CIFAR-10/ImageNet classification.

Computer Vision ImageNet Classification

Research Papers & Implementations

A collection of advanced deep learning architectures and algorithms built from scratch.

Transformers

Implementation of Research Paper

Implementations of various Transformer architectures including Multi-headed attention, Transformer XL, GPT, MLP-Mixer, ViT, and Switch Transformer.

Transformers Attention NLP

Recurrent Highway Networks

Implementation of Research Paper

Implementation of Recurrent Highway Networks with enhanced depth and sequential processing capabilities.

RNN Sequence Models

LSTM

Implementation of Research Paper

Deep learning models utilizing Long Short-Term Memory networks for processing sequential data.

LSTM RNN Sequence

HyperNetworks

Implementation of Research Paper

Implementation of HyperLSTM - utilizing a smaller network to generate weights for a larger LSTM network.

HyperNetworks LSTM

ResNet

Implementation of Research Paper

Implementation of Residual Networks to train extremely deep neural networks via shortcut connections.

ResNet Vision CNN

ConvMixer

Implementation of Research Paper

Implementation of ConvMixer, substituting convolutions for self-attention and MLP operations in vision tasks.

ConvMixer Vision

Capsule Networks

Implementation of Research Paper

Implementation of Capsule Networks to better model hierarchical relationships in image classification.

Capsule Nets Vision

Generative Adversarial Networks

Implementation of Research Paper

Implementations of GAN architectures including Original GAN, Deep Convolutional GAN, Cycle GAN, Wasserstein GAN, and StyleGAN 2.

GAN Generative

Diffusion Models

Implementation of Research Paper

Implementations of Generative Diffusion models including Denoising Diffusion Probabilistic Models (DDPM).

Diffusion Generative DDPM

Sketch RNN

Implementation of Research Paper

Implementation of Sketch RNN for generating vector-based drawings using seq2seq VAEs.

Sketch RNN Generative RNN

Graph Neural Networks

Implementation of Research Paper

Implementations of Graph Attention Networks (GAT) and Graph Attention Networks v2 (GATv2).

GNN GAT Graph

Counterfactual Regret Minimization

Implementation of Research Paper

Solving games with incomplete information, such as Kuhn Poker, using Counterfactual Regret Minimization (CFR).

CFR Game Theory RL

Reinforcement Learning

Implementation of Research Paper

Implementations of RL algorithms like PPO, Deep Q Networks, Prioritized Replay, and Dueling Networks.

RL PPO DQN

Optimizers

Implementation of Research Paper

Implementation of deep learning optimizers including Adam, AMSGrad, Adam with warmup, Noam, Rectified Adam, and AdaBelief.

Optimizers Training

Normalization Layers

Implementation of Research Paper

Implementations of Batch, Layer, Instance, Group, Batch-Channel Normalizations, and Weight Standardization.

Normalization DL

Distillation

Implementation of Research Paper

Implementation of Knowledge Distillation techniques to transfer knowledge to efficient models.

Distillation Compression

Adaptive Computation

Implementation of Research Paper

Implementation of Adaptive Computation models like PonderNet to dynamically adjust computation steps.

Adaptive PonderNet

Uncertainty

Implementation of Research Paper

Utilizing Evidential Deep Learning to thoroughly quantify classification uncertainty in neural networks.

Uncertainty Evidential

Swipe / Scroll to explore

Education

MS, Computer Science

Felician University | Rutherford, NJ

May 2026

Relevant Coursework: Data Science, Data Mining, Artificial Intelligence

BTech, Computer Science & Engineering

Maharashtra Institute Of Technology | Pune, India

May 2024

Relevant Coursework: Deep Learning, Linear Algebra, Calculus

Insights & Articles

My latest technical writings and thoughts published on Substack.

Fetching latest articles...

View All on Substack

Podcasts & Talks

Exploring the intersection of AI, Quantum Computing, and Philosophy on Cyber Socratic.

C++ to AI: Bridging Tech Eras

The Socratic Embers

A deep dive with Kristofer A exploring the evolution from low-level systems programming to modern artificial intelligence paradigms.

AI Evolution C++ Interview

Bridging AI Realities: Jeff Smith

The Socratic Embers

Discussion with Jeff Smith, founding leader at PyTorch, covering Open Source and Startup Innovation in the AI landscape.

PyTorch Open Source Innovation

The Path to Financial Freedom

The Socratic Embers

Kevin Talcott shares insights integrating economic strategy with tech philosophy for scaling sustainable success.

Finance Strategy Growth

Finding Solitude in a Noisy World

The Socratic Embers

Discussing the mental frameworks needed to build clarity and focus amid rapid technological distractions.

Philosophy Focus Mindset

Swipe / Scroll to explore

View All Episodes

Certifications

Deep Learning Specialization

DeepLearning.AI

Issued Jun 2020

Sequence Models

DeepLearning.AI

Issued Apr 2025

Convolutional Neural Networks

DeepLearning.AI

Issued Apr 2025

Structuring Machine Learning Projects

DeepLearning.AI

Issued Mar 2025

Improving Deep Neural Networks

DeepLearning.AI

Issued Mar 2025

Neural Networks and Deep Learning

DeepLearning.AI

Issued Feb 2025

Machine Learning Specialization

Stanford Online

Issued Jul 2022

Unsupervised Learning & Recommenders

Stanford Online

Issued Nov 5, 2024

Grade: 98.20%

Advanced Learning Algorithms

Stanford Online

Issued Nov 4, 2024

Grade: 100%

Supervised Machine Learning

Stanford Online

Issued Oct 31, 2024

Grade: 99.60%

Solutions Architecture Virtual Experience

Amazon Web Services (AWS)

Issued Jul 2023

Cloud Security Engineer Specialization

Google Cloud (Coursera)

Issued Sep 2022

Security Best Practices in Google Cloud

Google Cloud (Coursera)

Issued Jun 2022

Android Bug Bounty Hunting

CodeRed

Issued Feb 2023

Ethical Hacking Essentials (EHE)

CodeRed

Issued Feb 2023

Python for Everybody Specialization

University of Michigan

Issued Jun 2020

Programming for Everybody

University of Michigan

Issued Sep 26, 2022

Grade: 99.17%

Python Data Structures

University of Michigan

Issued Sep 27, 2022

Grade: 100%

Using Python to Access Web Data

University of Michigan

Issued Oct 4, 2022

Grade: 95.80%

Using Databases with Python

University of Michigan

Issued Nov 2, 2022

Grade: 96.88%

Capstone: Retrieving, Processing, and Visualizing Data

University of Michigan

Issued Oct 7, 2022

Grade: 92%

Create a Website Using Wordpress

Coursera

Issued Dec 2022

Python (Basic)

HackerRank

Issued Jul 2021

Get In Touch

Let's Connect

I'm open to opportunities in AI, Machine Learning, and Full Stack Development.

inamdara@students.felician.edu +1 (862) 238-0508

Rutherford, NJ 07070

Machine Learning Engineer, Artificial Intelligence Engineer, Senior AI Developer, ML Researcher, Deep Learning Specialist, Data Scientist, NLP Engineer, Computer Vision Engineer, Predictive Modeler, Generative AI Developer, Full Stack Developer, Software Engineer, Backend Engineer, Frontend Developer. Technology Stack: Python, PyTorch, TensorFlow, Keras, Scikit-Learn, Pandas, NumPy, OpenCV, Natural Language Processing, BERT, Transformer Models, LLM, Large Language Models, Generative Adversarial Networks (GANs), JavaScript, TypeScript, React, Next.js, Node.js, Express, MongoDB, PostgreSQL, SQL, REST APIs, GraphQL, AWS, Amazon Web Services, EC2, S3, Docker, Kubernetes, CI/CD, Git, GitHub, Linux, Bash, Agile, Scrum, Software Architecture, System Design, Problem Solving, Data Analysis, Artificial Neural Networks, Recurrent Neural Networks (RNN), Convolutional Neural Networks (CNN), Reinforcement Learning (RL). Experience with data pipelines, model training, hyperparameter tuning, model deployment, API integration, front-end development, responsive design, web performance optimization.