[an error occurred while processing this directive] [an error occurred while processing this directive]
[an error occurred while processing this directive]
[an error occurred while processing this directive]
Monash University

FIT5142 Advanced data mining - Semester 2, 2015

Advanced methods of discovering patterns in large-scale multi-dimensional databases are discussed. Solving classification, clustering, association rules analysis and regression problems on different kinds of data are covered. Data pre-processing methods for dealing with noisy and missing data in the context of Big Data are reviewed. Evaluation and analysis of data mining models are emphasised. Hands-on case studies in building data mining models are performed using popular modern software packages.

Mode of Delivery

Caulfield (Day)

Workload Requirements

Minimum total expected workload equals 12 hours per week comprising:

(a.) Contact hours for on-campus students:

  • Two hours of lectures
  • One 2-hour laboratory

(b.) Additional requirements (all students):

  • A minimum of 8 hours independent study per week for completing lab and project work, private study and revision.

See also Unit timetable information

Unit Relationships

Prerequisites

FIT5047 or FIT5045 or equivalent
Sound fundamental knowledge in maths and statistics; database and computer programming knowledge.

Chief Examiner

Campus Lecturer

Caulfield

Grace Rumantir

Consultation hours: Monday 4pm-5pm, Thursday 1-2pm or by appointment

Tutors

Caulfield

Ashish Singh

Consultation hours: TBA

Your feedback to Us

Monash is committed to excellence in education and regularly seeks feedback from students, employers and staff. One of the key formal ways students have to provide feedback is through the Student Evaluation of Teaching and Units (SETU) survey. The University’s student evaluation policy requires that every unit is evaluated each year. Students are strongly encouraged to complete the surveys. The feedback is anonymous and provides the Faculty with evidence of aspects that students are satisfied and areas for improvement.

For more information on Monash’s educational strategy, see:

www.monash.edu.au/about/monash-directions/ and on student evaluations, see: www.policy.monash.edu/policy-bank/academic/education/quality/student-evaluation-policy.html

Previous Student Evaluations of this Unit

Based on previous student feedback this unit is well structured and no major changes have been made for this semester. Further refinements of the materials on the topics covered will be made on a weekly basis and made available on Moodle.

If you wish to view how previous students rated this unit, please go to
https://emuapps.monash.edu.au/unitevaluations/index.jsp

Academic Overview

Learning Outcomes

On successful completion of this unit, students should be able to:
  • explain the kinds of data from which knowledge can be mined, the way each data type can be presented to a data mining algorithm, the kinds of patterns that can be mined from each data type;
  • evaluate the quality of data mining models;
  • perform pre-processing of large-scale multi-dimensional datasets in preparation for data mining experiments;
  • perform data pre-processing for data with outliers, incomplete and noisy data;
  • compare the various learning algorithms and the ability to effectively apply suitable algorithms to mine frequent patterns and associations from data, to perform data classification, data clustering and regression analysis;
  • use modern data mining tools to solve non-trivial data mining problems;
  • research the current trends in data mining applications;
  • work in a team to extract knowledge from a common dataset using various data mining methods and techniques.

Unit Schedule

Week Activities Assessment
0   No formal assessment or activities are undertaken in week 0
1 Introduction There is a self-assessed test (not marked) on basic maths and statistics and the fundamentals of Data Mining on Moodle that will be discussed in the Week 1 tutorial. Please complete this to see if you need to do further study prior to completing this unit.
2 Data Preprocessing  
3 Data Warehousing and Data Mining  
4 Classification and Prediction  
5 Cluster Analysis  
6 Mining Stream, Time-Series and Sequential Data  
7 Graph Mining, Social Network Analysis and Multirelational Data Mining  
8 Unit Test (during the lecture timeslot, tutorials are still on) Unit Test during Week 8 lecture (Monday 14 September 2015)
9 Ensemble Methods in Data Mining Assignment Stage 1 due online Monday 21 September 2015 at 11am
10 Mining Object, Spatial, Multimedia, Text and Web Data (Part 1)  
11 Mining Object, Spatial, Multimedia, Text and Web Data (Part 2) Assignment Stage 2 due on online Monday 12 October 2015 at 11am
12 Application & Trends in Data Mining and Revision  
  SWOT VAC No formal assessment is undertaken in SWOT VAC
  Examination period LINK to Assessment Policy: http://policy.monash.edu.au/policy-bank/
academic/education/assessment/
assessment-in-coursework-policy.html

*Unit Schedule details will be maintained and communicated to you via your learning system.

Teaching Approach

Lecture and tutorials or problem classes
This teaching and learning approach helps students to initially encounter information at lectures, discuss and explore the information during tutorials, and practice in a hands-on lab environment.

Assessment Summary

Examination (3 hours): 60%; In-semester assessment: 40%

Assessment Task Value Due Date
Unit Test 17% Unit Test during Week 8 lecture (Monday 14 September 2015)
Report on Implementations of Advanced Data Mining Techniques 17% Assignment Stage 1 due online Monday 21 September 2015. Assignment Stage 2 due online Monday 12 October 2015)
9 Weekly Revision Quizzes 3% Each weekly quiz will open at the end of the last tutorial in the week and will close at midnight on the Sunday of the week.
Participation in the clicker sessions in the lecture from Week 2 to Week 12 3% Weekly (during the lectures from Week 2 to Week 12)
Examination 1 60% To be advised

Assessment Requirements

Assessment Policy

Assessment Tasks

Participation

  • Assessment task 1
    Title:
    Unit Test
    Description:
    Learning Outcomes: 1, 2, 3, 4

    Closed-book unit test to be conducted in the lecture time slot in Week 8.
    Weighting:
    17%
    Criteria for assessment:

    Correct answers to questions, and quality of solutions to problems, which demonstrates understanding of the learning materials. Further detail of the format and coverage of the unit test will be made available on Moodle.

    Due date:
    Unit Test during Week 8 lecture (Monday 14 September 2015)
  • Assessment task 2
    Title:
    Report on Implementations of Advanced Data Mining Techniques
    Description:
    Learning Outcomes: 5, 6, 7, 8

    A pair of students work in a group to come up with a report on an approved research topic on advances in Data Mining and the students' implementations of the relevant advanced data mining methods (maximum 20 page long)
    Weighting:
    17%
    Criteria for assessment:

    The report will be assessed on the usual criteria, namely: breadth of literature survey, quality of analysis of literature, topicality, the use of a data set to illustrate the implementations of the relevant advanced data mining methods.

    There are 2 stages of the assignment:

    Stage 1: Write up of the structure of the report and the aspects to be covered in the report (non-assessable)
    Stage 2: Submission (17%).

    Due date:
    Assignment Stage 1 due online Monday 21 September 2015. Assignment Stage 2 due online Monday 12 October 2015)
  • Assessment task 3
    Title:
    9 Weekly Revision Quizzes
    Description:
    Learning Outcomes:1 to 7

    Weekly online revision quizzes will be conducted from Week 2.  Each weekly quiz covers the material covered in the respective week.
    Weighting:
    3%
    Criteria for assessment:

    Each weekly online quiz will have 10 multiple choice questions.  There is no negative mark for wrong answers.  The mark of each quiz will be recorded based on student's submission of his/her quiz attempt prior to the closing date of each quiz.

    Due date:
    Each weekly quiz will open at the end of the last tutorial in the week and will close at midnight on the Sunday of the week.
  • Assessment task 4
    Title:
    Participation in the clicker sessions in the lecture from Week 2 to Week 12
    Description:
    Participation in the clicker sessions will be marked during the lectures
    Weighting:
    3%
    Criteria for assessment:

    Active participation in lecture sessions using the response systems is expected. 3% of the final marks is allocated for this participation. The mark will be allocated based on the percentage of recorded participation in the clicker sessions during lectures.

    Due date:
    Weekly (during the lectures from Week 2 to Week 12)

Examinations

  • Examination 1
    Weighting:
    60%
    Length:
    3 hours
    Type (open/closed book):
    Closed book
    Electronic devices allowed in the exam:
    Scientific Calculators

Learning resources

Monash Library Unit Reading List (if applicable to the unit)
http://readinglists.lib.monash.edu/index.html

Feedback to you

Types of feedback you can expect to receive in this unit are:

  • Informal feedback on progress in labs/tutes
  • Graded assignments with comments
  • Test results and feedback
  • Quiz results

Extensions and penalties

Returning assignments

Assignment submission

It is a University requirement (http://www.policy.monash.edu/policy-bank/academic/education/conduct/student-academic-integrity-managing-plagiarism-collusion-procedures.html) for students to submit an assignment coversheet for each assessment item. Faculty Assignment coversheets can be found at http://www.infotech.monash.edu.au/resources/student/forms/. Please check with your Lecturer on the submission method for your assignment coversheet (e.g. attach a file to the online assignment submission, hand-in a hard copy, or use an electronic submission). Please note that it is your responsibility to retain copies of your assessments.

Online submission

If Electronic Submission has been approved for your unit, please submit your work via the learning system for this unit, which you can access via links in the my.monash portal.

Other Information

Policies

Monash has educational policies, procedures and guidelines, which are designed to ensure that staff and students are aware of the University’s academic standards, and to provide advice on how they might uphold them. You can find Monash’s Education Policies at: www.policy.monash.edu.au/policy-bank/academic/education/index.html

Faculty resources and policies

Important student resources including Faculty policies are located at http://intranet.monash.edu.au/infotech/resources/students/

Graduate Attributes Policy

Student Charter

Student services

Monash University Library

Disability Liaison Unit

Students who have a disability or medical condition are welcome to contact the Disability Liaison Unit to discuss academic support services. Disability Liaison Officers (DLOs) visit all Victorian campuses on a regular basis.

[an error occurred while processing this directive]