CS4480 - Data-Intensive Computing

Offering Academic Unit
Department of Computer Science
Credit Units
Course Duration
One Semester
Course Offering Term*:
Semester A 2018/19
Semester A 2019/20 (Tentative)

* The offering term is subject to change without prior notice
Course Aims

This course is aimed at equipping students with the ability to manage very large data sets (Big Data) using a cluster of commodity machines with the main focus on the Hadoop ecosystem. Specifically, the main objective of this course is twofold: to familiarize students with software systems and techniques for implementing distributed data-parallel programs; to provide an insight into internal mechanisms of large-scale data analytical systems. Students will also have the opportunity to work on a real-world data processing problem by implementing scalable solutions using techniques and software systems covered in this course and to deploy their solutions on a large-scale commodity cluster.

Assessment (Indicative only, please check the detailed course information)

Continuous Assessment: 60%
Examination: 40%
Examination Duration: 2 hours
For a student to pass the course, at least 30% of the maximum mark for the examination must be obtained.
Detailed Course Information


Useful Links

Department of Computer Science