Building Big Data Pipelines with PySpark MongoDB and Bokeh

0 (0 REVIEWS)
7 STUDENTS

GET THIS COURSE AND 2500+ OTHERS FOR ONLY £49 FIND OUT MORE

Overview:

Embark on an enriching journey with our online course, “Building Big Data Pipelines with PySpark, MongoDB, and Bokeh,” designed to equip you with the skills to handle large-scale data processing and visualization efficiently. Through this course, you’ll harness the power of PySpark for distributed computing, MongoDB for data storage, and Bokeh for interactive data visualization. Our hands-on approach ensures you gain practical experience in building robust data pipelines, enabling you to tackle real-world big data challenges effectively.

  • Interactive video lectures by industry experts
  • Instant e-certificate and hard copy dispatch by next working day
  • Fully online, interactive course with Professional voice-over
  • Developed by qualified first aid professionals
  • Self paced learning and laptop, tablet, smartphone friendly
  • 24/7 Learning Assistance
  • Discounts on bulk purchases

Main Course Features:

  • Comprehensive coverage of PySpark fundamentals
  • Hands-on projects to build end-to-end data pipelines
  • Integration of MongoDB for scalable data storage
  • Utilization of Bokeh for interactive and dynamic data visualization
  • Best practices for optimizing big data workflows

Who Should Take This Course:

  • Data engineers aspiring to work with big data technologies
  • Software developers interested in distributed computing and data processing
  • Data analysts seeking to enhance their skills in handling large-scale data sets

Learning Outcomes:

  • Master PySpark for distributed data processing
  • Build scalable data pipelines using PySpark and MongoDB
  • Create interactive visualizations with Bokeh
  • Implement best practices for big data pipeline development
  • Handle large volumes of data efficiently
  • Perform data analysis and transformation at scale
  • Optimize data processing workflows for performance
  • Interpret and present insights derived from big data analyses

Certification

Once you’ve successfully completed your course, you will immediately be sent a digital certificate. Also, you can have your printed certificate delivered by post (shipping cost £3.99). All of our courses are fully accredited, providing you with up-to-date skills and knowledge and helping you to become more competent and effective in your chosen field. Our certifications have no expiry dates, although we do recommend that you renew them every 12 months.

Assessment

At the end of the Course, there will be an online assessment, which you will need to pass to complete the course. Answers are marked instantly and automatically, allowing you to know straight away whether you have passed. If you haven’t, there’s no limit on the number of times you can take the final exam. All this is included in the one-time fee you paid for the course itself.

Course Curriculum

Section 01: Introduction
Introduction 00:10:00
Section 02: Setup and Installations
Python Installation 00:03:00
Installing Third Party Libraries 00:03:00
Installing Apache Spark 00:12:00
Installing Java (Optional) 00:05:00
Testing Apache Spark Installation 00:06:00
Installing MongoDB 00:04:00
Installing NoSQL Booster for MongoDB 00:07:00
Section 03: Data Processing with PySpark and MongoDB
Integrating PySpark with Jupyter Notebook 00:05:00
Data Extraction 00:19:00
Data Transformation 00:15:00
Loading Data into MongoDB 00:13:00
Section 04: Machine Learning with PySpark and MLlib
Data Pre-processing 00:19:00
Building the Predictive Model 00:12:00
Creating the Prediction Dataset 00:08:00
Section 05: Data Visualization
Loading the Data Sources from MongoDB 00:17:00
Creating a Map Plot 00:33:00
Creating a Bar Chart 00:09:00
Creating a Magnitude Plot 00:15:00
Creating a Grid Plot 00:09:00
Section 06: Creating the Data Pipeline Scripts
Installing Visual Studio Code 00:05:00
Creating the PySpark ETL Script 00:24:00
Creating the Machine Learning Script 00:30:00
Creating the Dashboard Server 00:21:00
Building Big Data Pipelines with PySpark MongoDB and Bokeh
ex Vat

SAVE 80% - OFFER ENDS SOON

TAKE THIS COURSE
  • Original price was: £125.00.Current price is: £25.00. ex Vat
  • 1 year
  • Course Badge
  • Course Certificate
  • 5 hours, 4 minutes
  • Gift this course
£25 /Unit Price
Total:
£125.00
Quantity:

Buying more than one of the same courses?

  • 50% discount for orders of 10+ courses
  • 60% discount for orders of 50+ courses
  • 70% discount for orders of 100+ courses

Looking for a more personalised package?

contact us now

Instructors

Profile Photo
A A
7 STUDENTS ENROLLED