Summary
Overview
Work History
Education
Skills
Certification
Hobbies
Languages
Work Availability
Timeline
Hi, I’m

Satyam Agarwal

Data Engineer
Copenhagen
The way to get started is to quit talking and begin doing.
Walt Disney
Satyam Agarwal

Summary

Data Engineer having 4+ years of experience in Big Data analytics and Data Warehousing using Hadoop Eco system tools, SQOOP, Hive, Spark, Scala, Unix Shell Scripting, SQL, Azure and Oracle. to enable a better data integration ,aggregation and reporting process.

  • Experience in creating SPARK programs with use of dynamic memory allocation and in memory computation to ensure robust transformations.
  • Experience in implementing partitioning, bucketing, vectorization, cost based optimization, indexing and various optimization techniques to improve the performance of Hive queries.
  • Experience in writing shell scripts and automating the spark and hive workloads using UC4 Automic Scheduler in the production system Experience with different file formats and handling the structured and nonstructural data using HIVE.
  • Working experience in Agile and SAFe environments. Having a decent knowledge in Machine Learning and experience on several algorithms like Regression, Decision Trees and Neural Networks
  • Responsive expert experienced in monitoring database performance, troubleshooting issues and optimizing database environment. Possesses strong analytical skills, excellent problem-solving abilities, and deep understanding of database technologies and systems. Equally confident working independently and collaboratively as needed and utilizing excellent communication skills.
  • Complex problem-solver with analytical and driven mindset. Dedicated to achieving demanding development objectives according to tight schedules while producing impeccable code.

Overview

4
years of professional experience
1
Certification

Work History

Tata Consultancy Services

Data Engineer
06.2019 - Current

Job overview

  • With expertise in building Big Data and Analytics, Responsible for design, development and optimization of scalable data pipelines for various structured and unstructured data sources, building API Integrations and accelerating performance of ETL processes using Hadoop Ecosystem tools /Apache Spark
  • Responsible for gathering Business Requirements from Product Owners and creation of rules and transformations in SPARK and Scala with respect to them in order to receive targeted data requirements
  • Responsible for building unified Power BI dashboard for entire group of teams
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Developed and delivered business information solutions for more then 500 different aggregation rules for several invoicing services.
  • Gathered, defined and refined requirements, led project design and oversaw implementation.
  • Contributed to internal activities for overall process improvements, efficiencies and innovation.

Education

IIIT
Bangalore,India

Post Graduate Diploma from Machine Learning And Artificial Intelligence
09.2021

VIT
Vellore

Bachelor of Technology from Computer Science And Engineering
06.2019

University Overview

  • 8.7 GPA
  • Event Coordinator, Tech Fest, 2017 to 2018



Skills

  • Big Data Technologies: Hadoop, HDFS, YARN, Apache Hive, SQOOP, Apache Spark
  • Service Provider: Hortonworks, Cloudera
  • Programming Languages: Scala, Python
  • Databases: MySQL, Oracle
  • Data Visualization: Microsoft POWER BI
  • Version Control: Bitbucket and Anaconda
  • Operating System: Linux/Unix, Windows
  • Atlassian Suite: Jira, Confluence
  • Cloud Computing: Azure Data Factory, Azure Databricks
  • Small file handling in HIVE and Spark
  • Performance optimization in joins and queries
  • Handling different big data file formats : like Parquet, ORC and Avro

Certification

  • Machine Learning Course by Stanford University (Coursera)
  • Object-Oriented Programming by EdX

Hobbies

Gym, Listening to music. Spending time with family 

Languages

Hindi
Native language
English
Proficient
C2
German
Beginner
A1
Availability
See my work availability
Not Available
Available
monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Timeline

Data Engineer

Tata Consultancy Services
06.2019 - Current

IIIT

Post Graduate Diploma from Machine Learning And Artificial Intelligence

VIT

Bachelor of Technology from Computer Science And Engineering
Satyam AgarwalData Engineer