real systemcomponent. Examples would include implementation of a new algorithm, implementation/evaluation of an algorithm from the literature that hasn't yet been implemented/evaluated), or use of data mining to address a problem with real-world data.
Group project proposals are welcome. If you are proposing a group project, the evaulation criteria should specify the different roles of the group members, and how your contributions will be evaluated individually.
I would like a draft sketch (at least the introductory overview
mentioned above) by Friday, March 10. I will provide feedback over
spring break, meet with you if you desire March 20-22, and then you
can provide the final writeup March 28.
Advanced topics. For this, you should provide a prioritized list of topics that you would like to cover in class. For each topic, provide at least one source for studying that topic (the most common source would be a paper from the research literature - please provide a link to electronic copy if possible, although just a citation is okay.) I don't guarantee that we'll cover every topic, and I may choose different papers than you suggest for covering a topic, but this should give me a good idea of your interests so that I can do my best to cover what you find most exciting. Note that this doesn't have to be (but may be) related to your project.
Suggested sources for papers are Data Mining and Knowledge Discovery, Transactions on Knowledge and Data Engineering, Knowledge and Information Systems, SIGKDD Explorations, and the KDD, ICDM, ECML/PKDD, PAKDD, and SIAM Data Mining conferences proceedings. You'll find the latest (and often preliminary) in workshops associated with those conferences, but I'd go with workshop papers only from the last couple of years. You'll also find work in conferences from other areas such as AAAI and SIGMOD, in particular the SIGMOD data mining workshops (probably on a par with some of the conferences above for quality.)
You might also find the list of topics from last year (below) to be a start.
I would like one or two topics by March 10. The rest can wait until March 28. You should expect to present a paper and lead discussion on one of your topics in class on March 23, 28, or 30 - a preferred date would help.
Electronic submission preferred. Please email to . Pdf is the safest for capturing non-text Hard copy is acceptable, please hand in at the beginning of class.