Open to Cloud & Data Roles

Hello, I'm

JankiRana

AWS Certified | ETL Developer | Cloud Architect

AWS Certified Cloud Architect & ETL Developer. Building scalable cloud infrastructure and data pipelines with Terraform, Kubernetes & AWS.

Profile

Janki Rana

Cloud Data Engineer

📍 Ontario, Canada

5+

Years Experience

1

Certifications

AWS & Azure

Cloud Platforms

+40% faster

Data Processing

Scroll

01. About

About Me

Results-driven Cloud & Data Engineer with 3+ years of ETL experience at Infosys and TCS, and hands-on cloud expertise using AWS, Azure, Terraform, and Kubernetes. AWS Certified Solutions Architect and Azure Data Engineer Associate. Passionate about designing scalable, automated cloud infrastructure and high-performance data pipelines.

Based in Ontario, Canada, I bring a unique combination of cloud engineering expertise, ETL development experience, and leadership skills from managing customer service operations at Walmart Canada.

Certifications

☁️

AWS Certified Solutions Architect

Amazon Web Services

Education

Graduate Certification — Cloud Architecture & Administration

Jan2023 - Aug2023

Seneca College, Ontario

3.8 GPA

Graduate Certification — Big Data Solutions Architecture

Jan2022 – Aug2023

Conestoga College, Ontario

3.65 GPA

Bachelor of Engineering — Computer Science

2018

Gujarat Technological University

9.22 CGPA

02. Skills

Tech Stack

Cloud & Infrastructure

AWS (S3, Glue, EC2, EKS, CloudWatch)Microsoft AzureTerraformAnsibleKubernetesDockerJenkins

Data & ETL

Apache SparkHadoop (HDFS, Hive)KafkaIBM DataStagePySparkETL Pipeline DesignAWS Glue

Languages & Databases

PythonSQLScalaShell / BashPostgreSQLMongoDBDynamoDBCassandra

Analytics & Tools

TableauPower BIMicrosoft ExcelJiraServiceNowGitHub ActionsSelenium

03. Experience

Work History

Senior Data Engineer

Senior

Wipro Limited

Oct 2023 – PresentOntario, Canada
  • Designed and developed scalable ETL pipelines using AWS Glue and PySpark, processing 500GB+ of data daily
  • Built and optimized data lake architecture on AWS S3 with Athena, improving query performance by 40%
  • Developed batch and near real-time data pipelines integrating Kafka, AWS Lambda, and Redshift
  • Implemented efficient data partitioning and indexing strategies, reducing processing time significantly
  • Migrated legacy ETL workflows from IBM DataStage to AWS Glue, improving throughput by 35%
  • Automated infrastructure provisioning using Terraform, reducing deployment time by 50%
  • Optimized Spark jobs by tuning memory usage, parallelism, and data skew handling
  • Ensured data quality and validation using Python-based frameworks
  • Collaborated with data analysts and business teams to design scalable data models

Data Engineer

Technical

Infosys Limited

Jan 2021 – Dec 2021Pune, Maharashtra, India
  • Engineered ETL pipelines using Apache Spark & Python — data processing speed +40%
  • Integrated workflows with Spark on Hadoop (HDFS, Hive) — query times reduced 35%
  • Automated data transformation tasks, cutting manual intervention by 50%
  • Collaborated with data architects to design scalable ETL solutions — +20% data accessibility
  • Applied advanced data cleansing with Python to ensure high-quality datasets

Junior Data Engineer

Technical

Tata Consultancy Services

Dec 2018 – Dec 2020Pune, Maharashtra, India
  • Improved system performance by 25% via technical reviews and updates
  • Reduced code errors by 15% through structured code reviews
  • Developed ETL documentation, reducing onboarding time by 30%
  • Resolved data-related issues, improving system uptime by 20%
  • Handled scripting tasks for debugging and automation using Python & Bash

04. Projects

Featured Projects

Cloud

Two-Tier Cloud Automation

Built a two-tier static web application on AWS using Terraform & Ansible. Includes VM provisioning, webserver deployment, and automated security scanning via GitHub Actions.

50% faster deployment
TerraformAnsibleAWSGitHub Actions
DevOps

Containerized App on Kubernetes

Deployed a web app on Amazon EKS with Docker containerization. Automated Docker image builds through GitHub Actions, publishing to private ECR with persistent data integration.

Production-ready K8s
KubernetesDockerAWS EKSECRGitHub Actions
Data

ETL Pipeline in AWS

Developed a fully automated ETL pipeline using Python, AWS S3 and Glue. Implemented data quality checks and monitoring with CloudWatch. Analyzed large datasets using PySpark.

Large-scale analytics
PythonAWS S3AWS GluePySparkCloudWatch
Data

ETL Pipeline for Financial Data

Built an ETL pipeline using IBM DataStage to process financial transaction data. Integrated data from multiple sources into Hadoop for large-scale analysis with quality checks.

Financial data at scale
IBM DataStageHadoopSQLPython

05. Contact

Let's Connect

I'm actively looking for Cloud Engineer & Data Engineering roles. Feel free to reach out for opportunities or collaborations!