Conference Deadlines:
RIAO (December 1 deadline - Pittsburgh)
SIGIR 2007 (January 28 deadline - Amsterdam)
WWW 2007 Conference (November 20 deadline (Poster deadline TBA) - Banff)
Goal: To study current papers from journals and conference proceedings in text retrieval and text mining. Examples of problems include novelty detection, web retrieval and web mining, ranking strategies, ambiguity resolution, knowledge discovery, web phenomenon including social networks, information extraction and text classification. The reading group is lead by Professor Padmini Srinivasan. Interested students (from beginning to advanced students) and faculty are invited to participate in the reading group. Participation format is informal with individuals taking turns to present an overview of the selected paper and lead the discussion. This forum has resulted in collaborative projects and published papers.
Special Focus: We will continue to read papers from different proceedings and journals. Additionally this semester we will take a close look at some of the TREC tracks. (TREC is an international forum for testing algorithms and models on well defined problems.) Participants are encouraged to suggest readings aligned with their interests.
Note if you would like to attend the reading group sessions and have a timing conflict please let me know.
Ellen M. Voorhees. Overview of TREC 2005.
1. Craswell N.
de Vries A.P
Soboroff I.
Overview of the TREC 2005 Enterprise Track.
2. Macdonald C, He B, Plachouras V, Ounis I.
University of Glasgow at TREC 2005:
Experiments in Terabyte and Enterprise Tracks with Terrier.
3. Fu Y, Yu W, Li Y, Liu Y, Zhang M. Tsinghua University (State Key Lab)
THUIR at TREC 2005: Enterprise Track.
Cao Y., Liu J. Bao S. and Li, H. Research on Expert Search at Enterprise Track of TREC 2005.
Lin J., Abels E., et al. A Menagerie of Tracks at Maryland: HARD, Enterprise, QA, and Genomics, Oh My! (Focus on section 3, the Enterprise track).
Zhang Y, Zincir-Heywood N., and Milios E. Narrative Text Classification for Automatic Key Phrase Extraction in Web Document Corpora. 7th ACM International Workshop on Web Information and Data Management (WIDM), CIKM 2005.
Balog K., Azzoparti L., de Rijke M. Formal Models for Expert Finding in Enterprise Corpora SIGIR 2006. (Do a google search on the title).
1. Lucene
2. Hema Raghavan, James Allan, Andrew McCallum, An Exploration of Entity Models, Collective Classification and Relation Description, Proceedings of the Second International Workshop on Link Analysis and Group Detection, LinkKDD2004, August 22, 2004 in conjunction with the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, WA, USA, pp 1-10.
Lucene (Brian Almquist) and a discussion of Netflix