Hi, I'm Siva Sakthi Velan Rajagopal

Data & AI/ML Architect | Principal Data Engineer | Cloud Modernization Expert

Siva Sakthi Velan R - Profile Photo

About Me

Data & AI/ML Architect with 12+ years of experience designing enterprise-scale Data Platforms, AI/ML ecosystems, GenAI solutions, and cloud-native architectures across AWS, GCP, and open-source technologies.

Expert in building Data Lakes, Data Mesh, real-time streaming systems, AI/ML pipelines, and LLM-based solutions using AWS Bedrock, SageMaker, Spark, Kafka, Airflow, MLflow, and modern MLOps frameworks.

Demonstrated success in cloud modernization, monolith-to-microservices transformation, data engineering at PB-scale, and leading cross-functional engineering teams of 30+ members. Strong client-facing, pre-sales, architectural governance, and delivery leadership experience.

12+
Years Experience
30+
Team Members Led
1PB+
Data Managed
90%+
Data Quality

Professional Experience

Delivery Manager

1CloudHub Oct 2023 – Present

Delivery + Data Architecture + AI/ML + GenAI + Cloud Modernization

  • Architected enterprise Data Lakes using AWS Glue, EMR, Kinesis, Lake Formation, Step Functions
  • Developed optimized PySpark ETL/ELT pipelines with 40% faster execution
  • Designed AI/ML systems: Car Damage Detection, Fire & Smoke Detection, Solar Predictive Maintenance
  • Built enterprise GenAI LLM/RAG platforms using Bedrock, Anthropic, LangChain, OpenSearch
  • Led cloud modernization: On-prem → AWS Data Lake, Monolith → Microservices (React, Spring Boot, EKS)
  • Directed 30-member cross-functional team; led pre-sales, PoCs, architecture reviews

Lead Engineer

GainInsights Solutions Jan 2016 – Oct 2023

Primary: Data Engineering | Secondary: App Dev + AI/ML

  • Built real-time & batch pipelines using BigQuery, Dataproc, Pub/Sub, Dataflow, processing 10TB+ daily
  • Improved GCP cost efficiency by 30% and query performance by 40%
  • Designed CDC frameworks using Pub/Sub for low-latency data propagation
  • Developed AI/ML solutions (Regression, Classification, Forecasting)
  • Managed 1PB+ Telecom datasets, handling 1M events/second real-time ingestion
  • Migrated legacy on-prem systems to GCP Data Lake

Python Developer

Mobiveil May 2014 – Dec 2015

Python/Java Web Development + WordPress + Early Data Engineering

  • Developed backend components, APIs, and workflow automation using Python & Java
  • Delivered multiple WordPress-based websites with custom themes and plugins
  • Optimized PySpark pipelines for analytics workloads (20% faster processing)
  • Led HDFS → S3 migration saving 60% annual infra cost

Web Developer

Redposh Private Limited May 2013 – Apr 2014

Primary: WordPress Developer — 200+ Websites Delivered

  • Designed, implemented, and deployed 200+ WordPress websites, themes, and plugins
  • Improved SEO, security, and user experience across high-traffic sites
  • Enhanced internal analytics pipelines with better partitioning strategies

Featured Projects

GenAI

GenAI Sandbox – Industry-Specific Chatbots

Built multi-industry RAG chatbots (Healthcare, Procurement, Telecom, BFSI, Manufacturing) with embeddings, semantic search, knowledge ingestion, and API-based inferencing.

AWS Bedrock Anthropic LangChain OpenSearch SageMaker
ML Platform

Data Platform – End-to-End ML Lifecycle

Architected full ML lifecycle platform: Data Ingestion → ETL/ELT → Feature Store → Training → Deployment → Monitoring using Spark, Airflow, MLflow.

Python Spark Airflow MLflow CNN XGBoost
Data Lake

Telecom Data Lake – PB-scale Analytics

Managed 1PB datasets with real-time analytics, KPI dashboards, and governed data lake layers using Spark, Hadoop, and Presto.

Spark Hadoop Presto QlikSense
FinOps

Cloud Cost Optimization Platform

Built automated FinOps insights using LLM-based summarizations and infra usage analysis with real-time optimization recommendations.

LLM FinOps AWS Analytics
Voice Bot

Contact Centre Intelligence (CCI) – Voice Bot

Built automated voice bot for Tier-1 telecom client using LLMs + speech pipelines with conversation flow, intent detection, and knowledge grounding.

GenAI LLM NLP Speech
Microservices

App Modernisation – Monolith → Microservices

Modernized legacy monolithic systems into microservices-based architecture deployed on AWS EKS using Spring Boot and React.

Spring Boot React EKS AWS

Skills & Technologies

Cloud Platforms

AWS (S3, Glue, EMR, Athena, Redshift) AWS (Kinesis, Step Functions, MWAA, Lambda) AWS (SageMaker, Bedrock, Lake Formation) GCP (BigQuery, Dataproc, Dataflow) GCP (Pub/Sub, Composer, GCS)

Big Data & Open-Source

Spark Hadoop Kafka Airflow Nifi Presto/Trino Superset Hive, HDFS

AI/ML & GenAI

MLflow SageMaker Pipelines CNN, XGBoost, Prophet LLMs, RAG, Vector DBs LangChain, AWS Bedrock OpenSearch, Pinecone/FAISS Computer Vision, NLP

Programming

Python SQL PySpark Java REST APIs React, Angular JavaScript/TypeScript

Architecture & Delivery

Data Lake/Data Mesh Streaming Analytics ETL/ELT App Modernization Microservices (Spring Boot, React) Cloud Migration Team Leadership (30+)

Get In Touch

I'm always open to discussing new projects, creative ideas, or opportunities to be part of your vision.

Location: Chennai, India

Phone: +91 9884176193