[an error occurred while processing this directive]
[an error occurred while processing this directive]In the modern corporate world, data is viewed not only as a necessity for day-to-day operation, it is seen as a critical asset for decision making. However, raw data is of low value. Succinct generalisations are required before data gains high value. Data mining produces knowledge from data, making feasible sophisticated data-driven decision making. This unit will provide students with an understanding of the major components of the data mining process, the various methods and operations for data mining, knowledge of the applications and technical aspects of data mining, and an understanding of the major research issues in this area.
Minimum total expected workload equals 12 hours per week comprising:
(a.) Contact hours for on-campus students:
(b.) Study schedule for off-campus students:
(c.) Additional requirements (all students):
See also Unit timetable information
CSE3212, GCO3828
FIT1004 or FIT2010 or equivalent
Neil Manson
Monash is committed to excellence in education and regularly seeks feedback from students, employers and staff. One of the key formal ways students have to provide feedback is through the Student Evaluation of Teaching and Units (SETU) survey. The University’s student evaluation policy requires that every unit is evaluated each year. Students are strongly encouraged to complete the surveys. The feedback is anonymous and provides the Faculty with evidence of aspects that students are satisfied and areas for improvement.
For more information on Monash’s educational strategy, see:
www.monash.edu.au/about/monash-directions/ and on student evaluations, see: www.policy.monash.edu/policy-bank/academic/education/quality/student-evaluation-policy.html
Based on previous student feedback on the topics on cluster analysis and anomaly detection, additional readings on their application were added to broaden students' knowledge in these areas.
If you wish to view how previous students rated this unit, please go to
https://emuapps.monash.edu.au/unitevaluations/index.jsp
Week | Activities | Assessment |
---|---|---|
0 | No formal assessment or activities are undertaken in week 0 | |
1 | The Need for Data Mining | Practical work and Review Questions |
2 | Model Building | Practical work and Review Questions |
3 | Model Representation | Practical work and Review Questions |
4 | Data Mining Process | Review Questions |
5 | Performance Evaluation | Review Questions |
6 | Engineering the input and output | Practical work and Review Questions; Assignment 1 due 13 April 2015 |
7 | Algorithms | Practical work and Review Questions |
8 | Implementation Issues | Practical work and Review Questions |
9 | Market basket analysis | Practical work and Review Questions |
10 | Cluster Analysis | Practical work and Review Questions; Assignment 2 due 11 May 2015 |
11 | Anomaly Detection | Review Questions |
12 | Case Studies and Data Mining Applications | Review Questions |
SWOT VAC | No formal assessment is undertaken in SWOT VAC | |
Examination period | LINK to Assessment Policy: http://policy.monash.edu.au/policy-bank/ academic/education/assessment/ assessment-in-coursework-policy.html |
*Unit Schedule details will be maintained and communicated to you via your learning system.
Examination (3 hours): 50%; In-semester assessment: 50%
Assessment Task | Value | Due Date |
---|---|---|
Assignment 1 | 15% | 13 April 2015 |
Assignment 2 | 15% | 11 May 2015 |
Weekly participation assessment | 20% | Weekly |
Examination 1 | 50% | To be advised |
Faculty Policy - Unit Assessment Hurdles (http://intranet.monash.edu.au/infotech/resources/staff/edgov/policies/assessment-examinations/assessment-hurdles.html)
Academic Integrity - Please see resources and tutorials at http://www.monash.edu/library/skills/resources/tutorials/academic-integrity/
Students are assessed on their participations in workshops and online activities in Moodle throughout the semester. This includes presentations and weekly participation in the discussion during the workshops.
This assessment will be made weekly and it is worth 20% of the total assessment.
Two assignment tasks are required to be completed by students in pairs. These assignments are worth 30% of the total assessment.
Members of a group are expected to contribute equally to the group work. Each member is to submit a peer review form independently to provide their assessment of every member's contribution. Normally the marks awarded to group members will be the same, however if the peer review indicates an uneven contribution, the marker will investigate and if necessary apportion marks accordingly.
More detailed criteria will be provided in the sample marksheet on the assignment web page.
Members of a group are expected to contributed equally to the group work. Each member is to submit a peer review form independently to provide their assessment of every member's contribution. Normally the marks awarded to group members will be the same, however if the peer review indicates an uneven contribution, the marker will investigate and if necessary apportion marks accordingly.
More detailed criteria will be provided in the sample marksheet on the assignment web page.
This is an individual assessment and assessed on participation rather than correctness.
1. Kennedy, R.L., Lee, Y. Roy, B.V., Reed, C.D. & Lippman, R.P., Solving Data Mining Problems through Pattern Recognition, Prentice Hall, 1998.
2. Cabena, P., Hadjinian, P., Stadler, R., Verhees, J. & Zanasi, A., Discovering Data Mining: from concept to implementation, Prentice Hall, 1997.
3. Berry, J.A.M. & Linoff, G. Data Mining Techniques for Marketing, Sales, and Customer Support, John Wiley & Sons, 1997.
4. Tan, P-N, Steinbach, M. & Kumar, V. Introduction to Data Mining, Addison Wesley, 2006.
5. Han, J., Kamber & Pei, J. M. Data Mining: Concepts and Techniques, Morgan Kaufmann, Third Edition, 2011.
6. Dunham, M.H., Data Mining: Introductory and Advance Topics, Pearson Education, 2003.
7. Groth, R., Data Mining: Building competitive advantage, Prentice Hall, 2000.
8. Berson,. A., Smith, S. & Thearling, K., Building Data Mining Applications for CRM, McGraw Hill. 2000.
9. Berry, J.A.M. & Linoff, G. Mastering Data Mining: The Art and Science of Customer Relationship Management, John Wiley & Sons, 2000.
10. Mena, J. Data Mining Your Website. Digital Press, 1999.
11. Westphal, C. & Blaxton, T. Data Mining Solutions, John Wiley & Sons, 1998.
12. Quinlan, J.R. C4.5: Program for Machine Learning, Morgan Kaufmann, 1993.
13. Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P. & Uthurusamy, R. Advances in Knowledge Discovery and Data Mining, AAAI Press/MIT Press, 1996.
14. Vasant Dhar. Data Science and Prediction, Communications of the ACM, Vol. 56, No. 12, December 2013, pages 64-73.
Monash Library Unit Reading List (if applicable to the unit)
http://readinglists.lib.monash.edu/index.html
Types of feedback you can expect to receive in this unit are:
Submission must be made by the due date otherwise penalties will be enforced.
You must negotiate any extensions formally with your campus unit leader via the in-semester special consideration process: http://www.monash.edu.au/exams/special-consideration.html
It is a University requirement (http://www.policy.monash.edu/policy-bank/academic/education/conduct/student-academic-integrity-managing-plagiarism-collusion-procedures.html) for students to submit an assignment coversheet for each assessment item. Faculty Assignment coversheets can be found at http://www.infotech.monash.edu.au/resources/student/forms/. Please check with your Lecturer on the submission method for your assignment coversheet (e.g. attach a file to the online assignment submission, hand-in a hard copy, or use an electronic submission). Please note that it is your responsibility to retain copies of your assessments.
All assessments must be submitted electronically via the Moodle site for this unit, and must include a signed coversheet for each assessment item.
Please check with your lecturer before purchasing any Required Resources. Limited copies of prescribed texts are available for you to borrow in the library, and prescribed software is available in student labs.
1. Software Title: WEKA, version 3.6 (http://www.cs.waikato.ac.nz/ml/weka/downloading.html)
2. Magnum OPUS version 4.6 (http://www.giwebb.com/downloads.html)
Limited copies of prescribed texts are available for you to borrow in the library.
Witten, I.H., Frank, E. & Hall, M.A.. (2011). Data Mining: Practical Machine Learning Tools and Techniques. (3rd Edition) Morgan Kaufmann Publishers.
Monash has educational policies, procedures and guidelines, which are designed to ensure that staff and students are aware of the University’s academic standards, and to provide advice on how they might uphold them. You can find Monash’s Education Policies at: www.policy.monash.edu.au/policy-bank/academic/education/index.html
Important student resources including Faculty policies are located at http://intranet.monash.edu.au/infotech/resources/students/
The University provides many different kinds of support services for you. Contact your tutor if you need advice and see the range of services available at http://www.monash.edu.au/students. For Malaysia see http://www.monash.edu.my/Student-services, and for South Africa see http://www.monash.ac.za/current/.
The Monash University Library provides a range of services, resources and programs that enable you to save time and be more effective in your learning and research. Go to www.lib.monash.edu.au or the library tab in my.monash portal for more information. At Malaysia, visit the Library and Learning Commons at http://www.lib.monash.edu.my/. At South Africa visit http://www.lib.monash.ac.za/.