Available for Opportunities

Sindhu
Satish Kumar

Software & ML/Data Engineer. I build production-grade pipelines, ML inference systems, and scalable data infrastructure, with a focus on reliability, performance, and real-world impact.

Who I am

I'm a recent MSCS graduate from USC with 3+ years of experience building data-intensive systems, ML pipelines, and scalable software. I care deeply about building things that work in the real world. Not just in notebooks.

My background spans full-stack data engineering, deep learning research, and production software development. I've worked across distributed systems, CI/CD automation, and ML model development, always with an eye on reliability, observability, and performance.

I'm currently seeking SDE, ML Engineer, and Data roles where I can contribute to meaningful products and continue growing as an engineer.

3+years of experience
90%code coverage achieved
0.90Dice score on medical imaging
40%build time reduction

Where I've worked

May 2024 — Present
ML Researcher
University of Southern California, Los Angeles
  • Designed a scalable deep learning inference pipeline using ResNet-18 to process 2,200+ dermatological images, covering the full workflow from preprocessing through model evaluation
  • Validated and optimized model accuracy using hybrid loss functions (BCEWithLogits + Dice), achieving a 0.90 Dice score and improving reliability by 12% over baseline
  • Engineered data augmentation workflows to improve DNN inference robustness across real-world inputs
Feb 2021 — June 2023
Software Development Engineer
Sixt, India
  • Decomposed a monolithic frontend into Micro-Frontend services, enabling independent deployments that reduced build times by 40%
  • Built a telemetry and observability system to capture high-velocity user interaction data, improving search ranking algorithms and increasing engagement by 15%
  • Resolved memory and rendering bottlenecks, increasing execution speed by 16% and raising test coverage from 65% to 90% via CI/CD
  • Mentored junior engineers and standardized onboarding docs, reducing ramp-up time by 25%

Academic background

Master of Science in Computer Science
University of Southern California
Aug 2023 — May 2025
Machine Learning · Foundations of AI · NLP · Deep Learning · Information Retrieval · Database Systems
Bachelor of Computer Science
JSS Science and Technology University
Aug 2017 — May 2021
Data Structures · Algorithms · OOP · Operating Systems · Computer Architecture · Data Mining · Statistics

Things I've built

Scalable Data Pipeline — Financial Market Insights
Fault-tolerant distributed pipeline using Airflow and PySpark with automated monitoring, diagnostics, and self-healing workflows.
↓ MTTR by 20% · ↓ config overhead 40%
AirflowPySparkPostgreSQLDocker
3D CNN — Neuroimaging Predictive Modeling
End-to-end 3D CNN inference pipeline using ResNet and DenseNet with custom convolution blocks for volumetric MRI data.
0.99 AUC · MSE improved 8.63 → 8.34
PyTorchResNetTransfer Learning3D CNN
Supply Chain Optimization Pipeline
Automated cloud-integrated pipeline using GitHub Actions and BigQuery for safety stock optimization and shipment delay prediction.
95%+ service level maintained
BigQueryGitHub ActionsPythonZ-score
Transformer-Based Legal Text Summarization
Parallelized inference pipelines for Transformer models (T5, DistilBART) with domain adaptation strategies for legal documents.
↑ ROUGE scores by 15%
T5DistilBARTHuggingFaceNLP

What I work with

Languages
PythonSQLJavaScriptTypeScriptC++Shell
ML / AI
PyTorchTensorFlowHuggingFaceScikit-learn
Data & Cloud
AirflowPySparkBigQueryAWSDockerPostgreSQL
Core Skills
ML PipelinesETL/ELTCI/CDAgileTDDREST APIs

Let's connect

I'm actively looking for SDE, ML Engineer, and Data roles. If you have an opportunity or just want to chat, reach out — I'd love to connect.