Big Data Programming (SE421) Course Detail

Course Name Course Code Season Lecture Hours Application Hours Lab Hours Credit ECTS
Big Data Programming SE421 Area Elective 2 2 0 3 5
Pre-requisite Course(s)
N/A
Course Language English
Course Type Elective Courses
Course Level Natural & Applied Sciences Master's Degree
Mode of Delivery Face To Face
Learning and Teaching Strategies Lecture, Drill and Practice.
Course Coordinator
Course Lecturer(s)
Course Assistants
Course Objectives Upon completing this course, the student will be able to design and implement map-reduce programs for various large data set processing tasks, and will be able to design and implement programs using Apache Spark.
Course Learning Outcomes The students who succeeded in this course;
  • Describe the architecture of Hadoop.
  • Explain the basic operation of HDFS
  • Develop MapReduce applications
  • View HDFS data from a relational perspective using Pig and Hive
  • Describe what Spark is all about know why you would want to use Spark
  • Use Resilient Distributed Datasets (RDD) operations
  • Use Resilient Distributed Datasets (RDD) operations
  • Implement and execute Apache Spark applications.
Course Content What is "Big Data"; the dimensions of Big Data; scaling problems; HDFS and the Hadoop ecosystem; the basics of HDFS, MapReduce and Hadoop cluster; writing MapReduce programs to answer questions about data; MapReduce design patterns; basic Spark architecture; common operations; Use Resilient Distributed Datasets (RDD) operations.

Weekly Subjects and Releated Preparation Studies

Week Subjects Preparation
1 Introduction to Big Data and Hadoop Chapter 1
2 Setting Up a Hadoop Cluster Chapter 9
3 Hadoop Distributed Filesystem (HDFS) Chapter 3
4 Hadoop Distributed Filesystem (HDFS) Chapter 4
5 MapReduce Chapter 2
6 MapReduce Chapter 5
7 MapReduce Chapter 6
8 MapReduce Chapter 7-8
9 Administering Hadoop Chapter 10
10 Pig Chapter 11
11 Hive Chapter 12
12 HBase Chapter 13
13 Spark Programming Other resources 2
14 Spark Programming Other resources 2
15 Final Exam
16 Final Exam

Sources

Course Book 1. Hadoop: The Definitive Guide, Tom White, 3rd. Ed., O'Reilly Media, 2012
Other Sources 2. MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems, Donald Miner, Adam Shook, O'Reilly Media, November 2012
3. Learning Spark: Lightning-Fast Big Data Analysis, Holden Karau, Andy Konwinski, Patrick Wendell, Matei Zaharia, O'Reilly Media, January 2015

Evaluation System

Requirements Number Percentage of Grade
Attendance/Participation - -
Laboratory 5 30
Application - -
Field Work - -
Special Course Internship - -
Quizzes/Studio Critics - -
Homework Assignments - -
Presentation - -
Project - -
Report - -
Seminar - -
Midterms Exams/Midterms Jury 1 30
Final Exam/Final Jury 1 40
Toplam 7 100
Percentage of Semester Work
Percentage of Final Work 100
Total 100

Course Category

Core Courses X
Major Area Courses
Supportive Courses
Media and Managment Skills Courses
Transferable Skill Courses

The Relation Between Course Learning Competencies and Program Qualifications

# Program Qualifications / Competencies Level of Contribution
1 2 3 4 5
1 An ability to apply advanced knowledge of computing and/or informatics to solve software engineering problems.
2 Develop solutions using different technologies, software architectures and life-cycle approaches.
3 An ability to design, implement and evaluate a software system, component, process or program by using modern techniques and engineering tools required for software engineering practices.
4 An ability to gather/acquire, analyze, interpret data and make decisions to understand software requirements.
5 Skills of effective oral and written communication and critical thinking about a wide range of issues arising in the context of working constructively on software projects.
6 An ability to access information in order to follow recent developments in science and technology and to perform scientific research or implement a project in the software engineering domain.
7 An understanding of professional, legal, ethical and social issues and responsibilities related to Software Engineering.
8 Skills in project and risk management, awareness about importance of entrepreneurship, innovation and long-term development, and recognition of international standards of excellence for software engineering practices standards and methodologies.
9 An understanding about the impact of Software Engineering solutions in a global, environmental, societal and legal context while making decisions.
10 Promote the development, adoption and sustained use of standards of excellence for software engineering practices.

ECTS/Workload Table

Activities Number Duration (Hours) Total Workload
Course Hours (Including Exam Week: 16 x Total Hours)
Laboratory 14 2 28
Application
Special Course Internship
Field Work
Study Hours Out of Class
Presentation/Seminar Prepration
Project
Report
Homework Assignments 5 6 30
Quizzes/Studio Critics
Prepration of Midterm Exams/Midterm Jury 1 15 15
Prepration of Final Exams/Final Jury 1 20 20
Total Workload 93