Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Vivek Sai Madhav Suri

North Reading

Summary

Results-driven Data Engineering and Data Science professional with 5.8+ years of experience designing, building, and optimizing data pipelines, analytics solutions, and financial reporting systems across diverse platforms. Skilled in collecting, cleaning, and transforming large, structured, and unstructured datasets from SQL Server, MySQL, PostgreSQL, and cloud platforms for advanced analytics. Designed, trained, and deployed machine learning models, including classification, regression, clustering, and deep learning, using Python, R, SQL, and PySpark. Developed NLP solutions leveraging LLMs for sentiment analysis, chatbot integration, and text summarization, improving customer service efficiency. Built interactive dashboards in Power BI and Tableau to uncover trends, KPIs, and growth opportunities. Experienced in managing cloud-based data workflows and pipelines on Azure and AWS using Databricks, Snowflake, Azure Data Factory, and SSIS, ensuring scalable and secure operations. Collaborated with cross-functional teams to align analytics solutions with business objectives, and implemented CI/CD pipelines for automated testing and deployment. Applied emerging AI/ML techniques to enhance prediction accuracy, automation, and operational efficiency, while maintaining compliance with data governance and ethical AI standards. Successfully delivered end-to-end data initiatives, automated reporting pipelines, and scalable cloud solutions that improved business performance.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Data Engineer

AXA XL
Connecticut
12.2023 - Current
  • Designed, built, and optimized data pipelines, automated reporting processes, and transformed large structured and unstructured datasets to provide actionable insights for real estate and facilities operations.
  • Designed, developed, and optimized scalable ETL / ELT pipelines for structured and unstructured data using SQL Server, MySQL, PostgreSQL, and cloud platforms (Azure and AWS).
  • Built and maintained data lakes and data warehouses leveraging Databricks, Snowflake, and Azure Data Factory to support analytics and BI reporting.
  • Automated ingestion, transformation, and integration workflows using SSIS, PySpark, and cloud-native services, ensuring data quality, governance, and reliability.
  • Collaborated with cross-functional teams, including BI developers, data scientists, and business stakeholders, to deliver high-performance data solutions aligned with business goals.
  • Designed and implemented data models, partitioning strategies, and performance tuning for large-scale databases to optimize query speed and reduce latency.
  • Developed real-time and batch data processing frameworks, enabling streaming analytics and timely decision-making.
  • Monitored, troubleshot, and improved pipeline reliability, ensuring compliance with data security, privacy, and governance standards.
  • Contributed to cloud migration initiatives and modernization of legacy systems to scalable, cloud-first data architectures.
  • Researched and adopted emerging technologies in data engineering, big data, and cloud ecosystems to improve scalability, automation, and efficiency.
  • Implement and maintain version control and CI/CD pipelines (using Git, Jenkins, and TFS) to ensure code quality, facilitate collaboration, and streamline deployments.
  • AXA XL, a global leader in property, casualty, and specialty risk solutions, leverages data-driven insights and innovative technology to help businesses manage complex risks across diverse industries.

BI / DWH Developer

Evergent Technologies
Hyderabad
05.2021 - 09.2023

Client 1: Simple TV – Virginia, USA

Project: Simple TV 64-bit – Internet TV player application for managing playlists, recording streams, playing multimedia, and maintaining user-level usage data.

Client 2: Etisalat – Dubai, UAE

Project: Etisalat TV – Streaming application delivering English, Arabic, and regional content across smart devices, providing telecom providers with comprehensive TV services and VOD content.

Responsibilities:

  • Designed and implemented scalable ETL pipelines using SSIS, Pentaho BI, Azure Data Factory, Snowflake, and Azure Databricks, integrating data from SQL Server, MySQL, and PostgreSQL; automated data delivery via WinSCP, SFTP, Amazon S3, and ADLS.
  • Built and automated financial/banking reports including Accruals, Deferrals, Invoices, Ledgers, Refunds, Chargebacks, Reconciliation, and Journal Reports using SSRS, Excel, and Power BI, improving reporting efficiency and accuracy.
  • Designed and maintained CI / CD pipelines using Jenkins and GitHub Actions for automated testing, version control, and deployment of Python-based ETL and analytics modules.
  • Developed interactive Power BI dashboards providing real-time insights on B2B/B2C payments, commissions, credit/debit notes, and outstanding balances. Collaborated with cross-functional teams to integrate LLM-based NLP pipelines for automating customer support responses and sentiment analysis within these dashboards.
  • Managed cloud-based data workflows in AWS (EC2, S3) and Azure (SQL Database, ADLS), ensuring secure, scalable infrastructure and optimized data operations.
  • Engineered big data pipelines using Azure Databricks, PySpark, and Scala to process and transform large datasets, improving performance and enabling advanced analytics at scale.
  • Applied machine learning techniques such as regression, classification, clustering, and deep learning using Python, R, and SQL to drive predictive analytics and business automation initiatives.
  • Wrote automation scripts using T-SQL, SQL, Python, R programming, Java, and C, operating across Windows and macOS environments, to support robust data processing, model development, and system integration.

Software Associate

Vipra Infotech
Hyderabad
02.2020 - 04.2021
  • Developed and maintained backend modules for internal tools and enterprise applications using Java, JDBC, and SQL, ensuring high performance and data integrity.
  • Designed and optimised SQL queries, stored procedures, and triggers for data extraction, transformation, and reporting needs.
  • Built Java-based automation scripts to streamline testing workflows, reducing manual validation time and improving release efficiency.
  • Participated in all phases of the SDLC, including requirements gathering, system design, coding, unit testing, integration testing, and deployment in Agile environments.
  • Collaborated with QA and DevOps teams to debug issues, validate test cases, and ensure smooth application releases.
  • Authored technical documentation, ER diagrams, and user manuals to support system handovers and improve team knowledge sharing.
  • Applied database design principles to enhance schema performance and query execution, laying a foundation for scalable data solutions.
  • Contributed to cross-functional discussions, aligning development work with business objectives and long-term data management strategies.

Education

Master of Science - Data Science and Analytics

New England College
Henniker, NH
01.2025

Bachelor of Science - Electronics and Communications Engineering

JNTU
Kakinada, AP, India
05-2020

Skills

  • Python
  • R
  • SQL
  • T-SQL
  • C
  • Java
  • PySpark
  • Scala
  • Regression
  • Classification
  • Clustering
  • Ensemble Learning
  • Deep Learning
  • NLP
  • Large Language Models (LLMs)
  • SQL Server
  • MySQL
  • PostgreSQL
  • Snowflake
  • Azure SQL Database
  • Azure SQL Data Warehouse
  • Azure Data Lake Storage (ADLS)
  • AWS (EC2, S3)
  • Azure (ADLS, Data Factory, Databricks, SQL)
  • SFTP
  • WinSCP
  • CI/CD pipelines (Jenkins, GitHub Actions)
  • Power BI
  • SSRS
  • SSIS
  • Pentaho BI
  • Tableau
  • Excel
  • Interactive Dashboard Development
  • Windows
  • MacOS
  • Linux (basic)

Certification

  • Core Java Certification
  • C Programming Certification
  • Seminar: Cryptocurrency & Blockchain
  • Project: IoT-based Password Door Lock System using AWS

Timeline

Data Engineer

AXA XL
12.2023 - Current

BI / DWH Developer

Evergent Technologies
05.2021 - 09.2023

Software Associate

Vipra Infotech
02.2020 - 04.2021

Master of Science - Data Science and Analytics

New England College

Bachelor of Science - Electronics and Communications Engineering

JNTU
Vivek Sai Madhav Suri