Jainish Savalia

Data Scientist | AI/ML Engineer

LinkedIn | GitHub

About

Highly accomplished Data Scientist with a proven track record in building scalable AI solutions, optimizing system performance, and significantly reducing infrastructure costs. Expertise spans high-performance data pipelines, vector search, automation agents, and cloud-native CI/CD, leveraging technologies like Kafka, OpenSearch, FAISS, and Kubernetes to drive efficiency and innovation. Adept at transforming complex data into actionable insights and delivering measurable impact across diverse technical domains.

Work Experience

Data Scientist 1

Genuin Codebase LLP

Jul 2023 - Jun 2025

Ahmedabad, Gujarat, IN

Led the development and optimization of scalable AI solutions and data pipelines, significantly enhancing operational efficiency and reducing infrastructure costs for Genuin Codebase LLP.

  • Optimized Kafka pipelines for PostgreSQL and OpenSearch integration, achieving a 0% reduction in data sync latency for real-time data flow.
  • Engineered high-performance vector search pipelines using FAISS and OpenSearch, decreasing query response times from 2 seconds to 300 milliseconds.
  • Automated CI/CD pipelines for deployments to Kubernetes and bare-metal servers, slashing deployment time by 80%, and migrated services to bare-metal infrastructure, cutting costs by 60%.
  • Engineered automated video ingestion pipelines, handling over 5,000 videos monthly and reducing manual workloads by 95%.
  • Developed AI-powered browser automation agents, significantly increasing user task efficiency, and implemented scalable task orchestration systems for efficient data pipeline management.

Machine Learning Intern

driveBuddyAI

Jan 2023 - Jun 2023

Ahmedabad, Gujarat, IN

Contributed to the improvement and deployment of AI/ML models and pipelines, driving significant gains in accuracy, efficiency, and safety for driveBuddyAI.

  • Improved object detection models, boosting accuracy by 50% and throughput by 20%.
  • Developed scalable PyTorch pipelines, enhancing model iteration speed and deployment efficiency by 30%.
  • Built an AI-driven collision warning system integrated with Roadside Monitoring, reducing false alerts by 35%.

Education

Computer Science and Engineering

Institute of Technology, Nirma University

8.10 CPI

Jul 2019 - May 2023

Ahmedabad, Gujarat, IN

Volunteer

Volunteer

Miroli Camp

Feb 2020 - Feb 2020

Ahmedabad, Gujarat, IN

Conducted a comprehensive community survey to assess socio-economic disparities and challenges within a rural village.

  • Surveyed over 100 residents across the village to identify daily difficulties and evaluate socio-economic disparities.
  • Collected and analyzed data to understand community needs and inform potential development initiatives.

Projects

TraceFlow: Observability for AI Agents

Jun 2025 - Jul 2024

Developed an observability solution for AI agents, encompassing data pipeline construction and a real-time monitoring dashboard.

Publications

Early-Stage Detection Model Using Deep Learning Algorithms for Parkinson's Disease Based on Handwriting Patterns

Advancements in Smart Computing and Information Security

Jan 2022

Developed a deep learning model for early-stage Parkinson's Disease detection, achieving 97.62% accuracy in classifying handwriting patterns.

Triplet loss for Chromosome Classification

Journal of Innovative Image Processing

Jan 2022

Implemented a deep learning approach using triplet loss for chromosome image classification, achieving an accuracy of 89.96%.

Skills

Programming Languages

  • Python
  • Go
  • JavaScript
  • C/C++
  • Java

Frameworks & Libraries

  • FastAPI
  • Langchain
  • React
  • PyTorch
  • NumPy
  • Pandas
  • Plotly

Tools

  • Git
  • Jira
  • Kubernetes
  • FFmpeg
  • Kafka
  • OpenSearch

Databases & Storage

  • PostgreSQL
  • MongoDB
  • Elasticsearch
  • Milvus
  • Redis
  • Aerospike

Cloud & Infrastructure

  • AWS
  • OCI
  • Bare-metal Servers
  • Docker
  • Kubernetes

Data Science & AI

  • AI Solutions
  • Machine Learning
  • Deep Learning
  • Object Detection
  • Vector Search
  • Automation Agents
  • Data Pipelines
  • Model Optimization
  • CI/CD Pipelines
  • Task Orchestration