Skip to main content
Happy Holidays!

Our offices are closed Dec. 21 – Jan. 1 for winter break. We look forward to seeing you in the New Year!

man and woman staring off into distance with color gradient

Big Data, Introduction | DBDA.X401


In the new paradigm of Big Data where we trust distributed systems to process information across server clusters, we increasingly rely on technologies to manage the massive amounts of information generated by social media, online transactions, web logs, and sensors. These technologies include handling unstructured, semi-structured, and structured data, as well as processing, real-time analytics, and visualization. They are especially useful for reporting in circumstances where a relational database approach is not effective or is too costly.

In this comprehensive introductory course for managers, analysts, architects and developers, you will gain insights into cloud-based Big Data architectures. We will cover Hadoop, Spark and other Big Data platforms based on SQL, such as Hive.

This course includes an overview of the Big Data technologies and frameworks such as HDFS, MapReduce, Spark, Kafka and Hive. The final project will give the ability to design the Big Data Pipeline with the understanding of all acquired knowledge of Big Data Technologies.


Learning Outcomes
At the conclusion of the course, you should be able to

  • Describe big data concepts, characteristics, data management and warehouse
  • Explain the significance of big data and industry use case references
  • Compare and contrast NoSQL with Hadoop, leverage Hadoop ecosystem for analyzing big data and use Hive/NoSQL for data analysis

Topics Include

  • Evolution of Big Data
  • Big Data use cases
  • Big Data applications architecture
  • Understanding Hadoop distributed file system (HDFS)
  • How MapReduce framework works
  • Introduction to HBase (Hadoop NoSQL database)
  • Introduction to Apache Kafka
  • Introduction to Spark and SparkSQL
  • Developing Spark/SparkSQL applications
  • Managing tables and query development in Hive
  • Introduction to data pipelines

Skills Needed:

Moderate level of programming knowledge in Python and SQL

Next Section Starts In:


Days
:
Hours
:
Mins
:
Secs

Jan. 10, 2025, 6:30 p.m.
2025-01-10T18:30:00-08:00
Have a question about this course?
Speak to a student services representative.
Call (408) 861-3860
FAQ
ENROLL EARLY!
This course is related to the following programs:

Sections Open for Enrollment:

Open Sections and Schedule
Start / End Date Quarter Units Cost Instructor
01-10-2025 to 03-14-2025 3.0 $910

Venkat R Mavram

Enroll

Final Date To Enroll: 01-10-2025

Schedule

Date: Start Time: End Time: Meeting Type: Location:
Fri, 01-10-2025 6:30 p.m. 9:30 p.m. Flexible SANTA CLARA / REMOTE
Fri, 01-17-2025 6:30 p.m. 9:30 p.m. Flexible SANTA CLARA / REMOTE
Fri, 01-24-2025 6:30 p.m. 9:30 p.m. Flexible SANTA CLARA / REMOTE
Fri, 01-31-2025 6:30 p.m. 9:30 p.m. Flexible SANTA CLARA / REMOTE
Fri, 02-07-2025 6:30 p.m. 9:30 p.m. Flexible SANTA CLARA / REMOTE
Fri, 02-14-2025 6:30 p.m. 9:30 p.m. Flexible SANTA CLARA / REMOTE
Fri, 02-21-2025 6:30 p.m. 9:30 p.m. Flexible SANTA CLARA / REMOTE
Fri, 02-28-2025 6:30 p.m. 9:30 p.m. Flexible SANTA CLARA / REMOTE
Fri, 03-07-2025 6:30 p.m. 9:30 p.m. Flexible SANTA CLARA / REMOTE
Fri, 03-14-2025 6:30 p.m. 9:30 p.m. Flexible SANTA CLARA / REMOTE
Open Sections and Schedule
Start / End Date Quarter Units Cost Instructor
04-05-2025 to 06-14-2025 3.0 $910

Satyen Kansara

Enroll

Final Date To Enroll: 04-05-2025

Schedule

Date: Start Time: End Time: Meeting Type: Location:
Sat, 04-05-2025 9:00 a.m. 12:00 p.m. Flexible SANTA CLARA / REMOTE
Sat, 04-12-2025 9:00 a.m. 12:00 p.m. Flexible SANTA CLARA / REMOTE
Sat, 04-19-2025 9:00 a.m. 12:00 p.m. Flexible SANTA CLARA / REMOTE
Sat, 04-26-2025 9:00 a.m. 12:00 p.m. Flexible SANTA CLARA / REMOTE
Sat, 05-03-2025 9:00 a.m. 12:00 p.m. Flexible SANTA CLARA / REMOTE
Sat, 05-10-2025 9:00 a.m. 12:00 p.m. Flexible SANTA CLARA / REMOTE
Sat, 05-17-2025 9:00 a.m. 12:00 p.m. Flexible SANTA CLARA / REMOTE
Sat, 05-31-2025 9:00 a.m. 12:00 p.m. Flexible SANTA CLARA / REMOTE
Sat, 06-07-2025 9:00 a.m. 12:00 p.m. Flexible SANTA CLARA / REMOTE
Sat, 06-14-2025 9:00 a.m. 12:00 p.m. Flexible SANTA CLARA / REMOTE