As a Principal Data Engineer, I bring over a decade of expertise in the realms of design and development, offering robust enterprise solutions powered by the Java technology stack and an array of open-source frameworks.
My professional journey has been deeply intertwined with the dynamic world of Big Data technologies, boasting 9 years of hands-on experience with an impressive suite of tools and platforms. My areas of expertise include Hadoop, Spark, Pig, HBase, Hive, Sqoop, ElasticSearch, Mongo, and more.
One of my standout accomplishments is the successful setup of Hadoop clusters using diverse distributions such as Cloudera, Hortonworks, and Amazon EMR, seamlessly integrating them into the fabric of cloud platforms like Amazon Web Services (AWS) and Google Cloud Platform (GCP).
My technical prowess extends to crafting ETL infrastructure both in standalone and distributed modes, where I’ve masterfully engineered data pipelines using an array of cutting-edge technologies and tools, all within the versatile landscape of cloud computing.
Beyond my core role, I’m an active contributor to the thriving knowledge-sharing community on Stackoverflow, where I’ve earned a reputation of 6600+ for providing insightful answers to queries spanning Java/scala, Spring, Hadoop, Spark, and other pivotal big data technologies.
My journey in the field of data engineering has been marked by a relentless pursuit of excellence, a commitment to harnessing the full potential of Big Data, and a passion for sharing knowledge with the broader tech community.