FindMyGuru - A Trusted Tutor & Institute Discovery Platform

FindMyGuru A Trusted Tutor & Institute Discovery Platform

Qualification:M.Sc.
Language:English, Telugu
Experience:3 years

★★★★★4/5

Jyotin Padhi

Online

Skills :

Big Data Analytics Pyspark

About :

Jyotin Padhi is a dedicated data engineering and analytics tutor with 3 years of hands-on experience in PySpark and Big Data Analytics . With strong academic qualifications includi…

Jyotin Padhi

Online

Qualification:M.Sc.
Language:English, Telugu
Experience:3 years

Skills :

Big Data Analytics Pyspark

★★★★★ 4/5

Jyotin Padhi is a dedicated data engineering and analytics tutor with 3 years of hands-on experience in PySpark and Big Data Analytics . With strong academic qualifications includi...

FindMyGuru is a tutor discovery platform that helps students find and connect with experienced tutors and institutes across a wide range of subjects and skills. Students can explore tutor profiles, compare expertise, and contact tutors directly for online or in-person learning.FindMyGuru facilitates discovery and connections between students and tutors or institutes. All classes and learning arrangements are handled directly between students and the respective tutors or institutes

Courses by Jyotin Padhi

Course Mode:

Online

Duration:

4 Hour

Language:

English, Telugu

Location:

Rajajinagar, Bhubaneswar, Rayagada

Pricing:

398 INR

Batch Type:

Weekend

Course Content

1️⃣ Introduction to Big Data & PySpark

What is Big Data?
Hadoop ecosystem overview
Spark vs Hadoop MapReduce
Installation & environment setup
Introduction to PySpark architecture

2️⃣ PySpark Core Concepts

RDDs (Resilient Distributed Datasets)
Transformations & actions
Lazy evaluation
RDD persistence & optimization

3️⃣ PySpark DataFrames & SQL

DataFrame creation & operations
Schema definition
Importing CSV, JSON, Parquet, ORC
Spark SQL basics
SQL queries on large datasets
Window functions

4️⃣ Data Processing & ETL with PySpark

Data cleaning
Handling nulls & duplicates
Joins & aggregations
User-defined functions (UDFs)
File formats & partitioning
ETL pipelines with PySpark

5️⃣ Big Data Analytics with PySpark

Exploratory data analysis
Distributed computing principles
Performance optimization techniques
Caching & checkpointing
Cluster management basics

6️⃣ PySpark MLlib (Basics)

Basic ML algorithms with PySpark
Feature engineering in Spark
Pipelines & model evaluation

7️⃣ Real-Time & Batch Processing (Optional Module)

Introduction to Spark Streaming
Structured streaming concepts
Batch processing workflows

8️⃣ Hands-on Projects

ETL pipeline for large datasets
Analytics dashboard-ready dataset creation
Big Data business case implementation

Course Mode:

Online

Duration:

4 Hour

Language:

English, Telugu

Location:

Rajajinagar, Bhubaneswar, Rayagada

Pricing:

398 INR

Batch Type:

Weekend

Overall Student Ratings

4.0

★★★★★

Based on 4 ratings

5 star

25%

4 star

50%

3 star

25%

2 star

1 star

Get tutor location

Location: Rayagada, Rajajinagar, Bhubaneswar

Locate on Google map

Jyotin Padhi

Jyotin Padhi

Courses by Jyotin Padhi

Course Content

1️⃣ Introduction to Big Data & PySpark

2️⃣ PySpark Core Concepts

3️⃣ PySpark DataFrames & SQL

4️⃣ Data Processing & ETL with PySpark

5️⃣ Big Data Analytics with PySpark

6️⃣ PySpark MLlib (Basics)

7️⃣ Real-Time & Batch Processing (Optional Module)

8️⃣ Hands-on Projects

Get tutor location

Similar tutors with Skills

Similar tutors with locations

Start Your Teaching Journey Today

Start Your Teaching Journey Today

Start Your Teaching Journey Today