// available for hire_

Engineering Intelligence That Powers the Future

> |

I build robust, scalable artificial intelligence and machine learning pipelines. With deep expertise in deep learning, LLMs, and computer vision, I architect intelligent solutions that enterprises trust.

View My Work → Download Resume

Years Experience

APIs Built

Uptime Delivered

Enterprise Projects

inference.py

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

class ModelInference:
    def __init__(self, model_name: str):
        self.tokenizer = AutoTokenizer.from_pretrained(model_name)
        self.model = AutoModelForCausalLM.from_pretrained(
            model_name, torch_dtype=torch.float16, device_map="auto"
        )

    def generate(self, prompt: str) -> str:
        inputs = self.tokenizer(prompt, return_tensors="pt").to("cuda")
        outputs = self.model.generate(**inputs, max_new_tokens=100)
        return self.tokenizer.decode(outputs[0])

PyTorch

TensorFlow

Transformers

CUDA

// about_me

Turning Complex Problems Into Intelligent Solutions

I am an AI/ML engineer with a strong foundation in deep learning and scalable model deployment. My work revolves around designing intelligent architectures, from deploying state-of-the-art LLMs to building robust data pipelines. I have a deep passion for natural language processing and Computer Vision.

Having navigated the complexities of high-scale systems in predictive analytics, Generative AI, and MLOps domains, I understand that clean engineering isn't just an aesthetic choice—it's a critical component of model performance and reliability.

LocationSan Francisco, CA

AvailabilityOpen to Offers

LanguagesPython, C++, SQL

EducationMS Computer Science

// technical_arsenal

Mastery Across the Stack

🧠

Machine Learning

PyTorch, TensorFlow, Scikit-Learn, Deep Learning.

🤖

Generative AI & LLMs

Transformers, LangChain, RAG, Prompt Engineering.

☁️

MLOps & Cloud

AWS SageMaker, MLflow, Docker, Kubernetes.

🗄️

Data Engineering

Pandas, Spark, SQL, Vector Databases.

Deep Learning & Neural Networks98%

Python & Scientific Computing97%

Generative AI & LLMs95%

MLOps & Model Deployment94%

Computer Vision & NLP92%

Data Engineering & Pipelines88%

PyTorch

TensorFlow

CUDA

HuggingFace

MLflow

FastAPI

AWS SageMaker

Pinecone

Git

🧠

AI-First Mindset

Solving complex business problems with cutting-edge ML models.

⚡

Inference Performance

Optimizing weights, quantization, and accelerating serving with GPUs.

📈

Scalability by Design

From distributed training clusters to seamless model serving architectures.

// featured_projects

Featured Work

Enterprise RAG Platform

A highly scalable Retrieval-Augmented Generation (RAG) platform delivering precise answers from millions of internal documents. Employs advanced semantic search and integrates with state-of-the-art open-source LLMs.

2M+ queries/day<200ms TTFT95% Accuracy

PythonLangChainPineconeWeaviateFastAPI

Live Demo GitHub Repo

Vision API Gateway

Real-time object detection and semantic segmentation gateway for autonomous systems. Processes high-definition video streams with sub-millisecond overhead per frame.

PyTorchYOLOv8OpenCVTensorRT

Predictive Analytics Engine

Advanced time-series forecasting system to predict logistics supply chain bottlenecks up to two weeks in advance. Built using gradient-boosted trees and deep learning models.

XGBoostPandasAWS SageMaker

DeepForge Diffusion

Custom fine-tuned stable diffusion model that generates brand-compliant marketing assets efficiently and safely, leveraging custom LoRA weights.

Stable DiffusionDiffusersLoRA

// work_history

Professional Journey

2022 – Present

Senior AI Engineer

Tech Innovations Inc.

Led the migration from legacy ML infrastructure to a modern, scalable MLOps platform on AWS.
Reduced model inference latency by 60% through aggressive quantization and TensorRT optimization.

PyTorchAWS SageMakerMLflow

2020 – 2022

Machine Learning Developer

Data Systems Co.

Built high-throughput recommendation pipelines and deployed collaborative filtering models.
Engineered features from datasets with over 100M+ rows efficiently using distributed Spark clusters.

PythonApache SparkDocker

Certifications & Milestones

☁️

AWS Certified Machine Learning

Specialty Level

🏅

Kaggle Master

Top 1% Competitor

🎓

DeepLearning.AI

Specialization Certified

⭐

500+ GitHub Stars

Open Source AI Libs

// peer_reviews

What People Say

"One of the strongest AI engineers I've worked with. Their ability to distill complex research papers into production-ready PyTorch implementations is unmatched."

Jason MillerVP of Engineering @ Tech Innovations

"Pioneered our shift to generative AI. They don't just train models; they architect AI systems that are scalable, reliable, and exceptionally accurate."

Sarah R.Head of Data @ Data Systems

Let's Build Something Extraordinary

Currently open to new opportunities, specialized freelance projects, and collaborations.

Email[email protected]

LinkedInin/yourprofile

LocationSan Francisco, CA

Open to opportunities