Coaching Course: 0 to 1: Spark for Data Science with Python
What you'll learn
- Effectively check and installing of Spark dependencies.
- Define PySpark and check the PySpark Package by running a program.
- Define transformations and actions to effectively extract information and retrieve results.
- Create a base RDD and perform a count() action to view counts of Dataset independently.
- Introduce RDD partitions and define the functions and customization features of RDD partitions.
- Check and apply partitions within RDD independently.
- Explore the parallel application of map() and reduce() operations.
Course content
- Preview03:18
- Preview02:17
- Preview06:33
- 10:34Learning Style 2
- 08:19Learning Style 3
Requirements
- Assumed knowledge of Python.
- Knowledge of writing Python code directly in PySpark shell.
- Knowledge of Java and IDE which supports Maven, like IntelliJ IDEA/ Eclipse would be helpful.
- Optional knowledge of Hadoop.
Description
**This coaching course is approved by SkillsFuture Singapore. It combines online learning with face-to-face skills coaching sessions conducted in small groups in Singapore. Singaporeans can purchase this coaching course with their SkillsFuture Credit. Other learners in Singapore (without SkillsFuture Credit) may also purchase this course on their own account.
From 0 to 1 : Spark for Data Science with Python is a course that consists of learning/ Using of Spark and Python to work on a variety of datasets. Learners will gain experience and skills in analyzing data. They will learn and use the techniques, technologies and tools. These include machine learning and data science with spark’s core functionality and built-in libraries such as RDDs, Dataframes, SparkSQL, MLlib, Spark Streaming and GraphX with algorithms like PageRank, MapReduce and Graph datasets.
Who this course is for:
- Young and new entrants to the workforce
- Supervisors looking to upgrade to managerial roles
- Junior Managers looking to upgrade to middle managerial roles
- Middle managers looking to upgrade to senior managerial roles
- Owners of Small & Medium Size Enterprises
Instructors
DioPACT is focused on using technology as enablers to make learning easy, engaging and effective. Premised on innovative designs, pedagogy and research, we provide quality learning experiences for learners globally. DioPACT offers bespoke solutions for organisations to integrate learning, training and assessment of work-based competencies via blended learning strategies. Specifically, we combine the strengths of Classroom-Facilitated Learning, Massive Open Online Courses (MOOCs) in partnership with UDEMY Inc, to achieve learning outcomes. We also produce chatbots for learning to drive nuggetised learning.
Dioworks is an e-learning design company focused on using technology as enablers to make learning easy, engaging and effective. Premised on innovative designs, pedagogy and solid research, we provide quality learning experiences for learners globally. Dioworks offers bespoke solutions for organisations to integrate learning, training and assessment of work-based competencies via blended learning strategies. Specifically, we combine the strengths of Classroom-Facilitated Learning, Massive Open Online Courses (MOOCs) in partnership with UDEMY Inc, and our "Kinetic Coach" automated response training solution to achieve learning outcomes.