<Developer/>

Alin Verma

Data Engineer

Building scalable data infrastructure for ML-powered futures

Data Engineer specializing in AWS cloud architecture, Snowflake data warehousing, and ML pipeline development. Passionate about transforming raw data into actionable intelligence through robust, scalable solutions.

Scroll
<Experience Highlights/>

Recent Work

Data Mavericks

Data Engineer

Mar 2024 - Present

AWSSnowflakeKinesisLambda

UIDAI Technology Centre

Data Science Intern

May 2023 - July 2023

PythonTensorFlowOpenCVscikit-learn

Samsung Research Institute

Research Intern

Dec 2022 - Aug 2023

Deep LearningCNNPythonPose Estimation
<Featured Work/>

Key Projects

Real-time ML Pipeline on AWS

Data Engineering

Designed and deployed a scalable real-time machine learning pipeline using AWS Kinesis, Lambda, and SageMaker for fraud detection with sub-second latency.

Processing 10K+ events per second
99.9% uptime with automated failover
40% cost reduction through optimization
AWS KinesisLambdaSageMakerS3DynamoDB

Snowflake Data Warehouse Optimization

Data Warehousing

Built and optimized a multi-terabyte data warehouse in Snowflake, implementing advanced ELT patterns and improving query performance by 60%.

60% faster query execution
Automated data quality checks
Reduced storage costs by 35%
SnowflakePythondbtAirflow

Biometric Fraud Detection System

Machine Learning

Developed CNN-based deep learning models for fingerprint fraud detection achieving 95% accuracy on imbalanced datasets.

95% detection accuracy
Handled class imbalance effectively
Production-ready deployment
TensorFlowOpenCVPythonscikit-learn

Pose Estimation Pipeline

Computer Vision

Built end-to-end deep learning pipeline for real-time pose estimation and keypoint detection using state-of-the-art CNN architectures.

Real-time inference at 30 FPS
Multi-person detection support
Optimized for edge devices
PyTorchOpenCVPythonCUDA
<Tech Arsenal/>

Skills & Expertise

AWS

AWS

Cloud-based data and machine learning infrastructure design

  • Model deployment support, scalable ingestion, and real-time processing
  • Production monitoring, reliability, and cost-aware architecture
  • Lambda, Kinesis, S3, Glue, API Gateway, EC2, RDS
Snowflake

Snowflake

Data warehousing and analytics for ML workloads

  • ELT pipelines supporting feature engineering and experimentation
  • Snowflake Cortex, AI Agents, and Snowflake Intelligence
  • Performance optimization and cost management
AI/ML

AI/ML

Machine learning and deep learning expertise

  • Text and image classification with CNN architectures
  • Data preprocessing, feature engineering, and handling imbalanced datasets
  • TensorFlow, scikit-learn, OpenCV, PyTorch
Data Engineering

Data Engineering

Building robust data infrastructure

  • ETL/ELT pipeline design and optimization
  • Real-time and batch processing systems
  • Python, SQL, Apache Spark, Airflow

Technologies I Work With

PythonSQLAWS LambdaAWS KinesisS3GlueAPI GatewaySnowflakeTensorFlowPyTorchscikit-learnOpenCVDockerGitApache SparkAirflowEC2RDSDynamoDBCloudFormationCNNDeep LearningETL/ELT