Text Retrieval & Text Mining Reading Group

Fall 2007

Friday 10:30 am to 12:00 pm.

School of Library & Information Science (meeting room number 3082)

3rd floor Main Library (here is a map showing the building location.
The room is on the side closest to Burlington Street)

Previous Years Reading Groups

Conference Deadlines:

to be added.

Goal: To study current papers from journals and conference proceedings in text retrieval and text mining. Examples of problems include topic models, web retrieval and web mining, ranking strategies, ambiguity resolution, knowledge discovery, web phenomenon including social networks, information extraction and text classification. The reading group is lead by Professor Padmini Srinivasan. Interested students (from beginning to advanced students) and faculty are invited to participate in the reading group. Participation format is informal with individuals taking turns to present an overview of the selected paper and lead the discussion. This forum has resulted in collaborative projects and published papers.

Note if you would like to attend the reading group sessions and have a timing conflict please let me know.

  1. September 7, 2007: Organizational meeting.

  2. September 14, 2007:

    Estimation and Use of Uncertainty in Pseudo-Relevance Feedback by K Collins-Thompson, J Callan. ACM SIGIR Proceedings, 2007. (lead Brian Almquist)

  3. September 21, 2007:

    Topic evolution and social interactions: how authors effect research by Zhou et al. CIKM Proceedings, 2006. (lead Viet Ha Thuc).

  4. September 28, 2007:

    What makes a query difficult? by David Carmel et al. SIGIR 2006. (lead Brian Almquist)

  5. October 5, 2007:

    MedScan, a natural language processing engine for MEDLINE abstracts by Svetlana Novichkova, Sergei Egorov and Nikolai Daraselia. Bioinformatics (lead Cuong Bui)

  6. October 12, 2007:

    An Exploration of Proximity Measures in Information Retrieval by Tao, T and Zhai, C. SIGIR 2007. (lead Brian Almquist)

  7. October 19, 2007:

    Continuation with previous week's paper

  8. October 26, 2007:

  9. November 2, 2007:

    Brian Almquist presents his research on TREC legal.

  10. November 16, 2007:

    Recognising Textual Entailment with Logical Inference by Bos and Markert, Proceedings Human Language Technology and Empirical Methods in Natural Language Processing. 2005. (lead Cuong Bui)

  11. November 23, 2007: Thanksgiving Break

  12. November 30, 2007:

    LDA-Based Document Models for Ad-hoc Retrieval by Wei and Croft, SIGIR 2006. (lead Viet Ha Thuc)

  13. December 2, 2007:

    Cumulated Gain-Based Evaluation of IR Techniques by Järvelin andKekäläinen, ACM TOIS 2. (lead Bob Arens)

  14. December 9, 2007:

    Adaptive classifiers, topic drifts and GO annotations by P. Srinivasan, AMIA 2007. (lead Padmini Srinivasan)