Text Retrieval & Text Mining Reading Group

Spring 2005

Fridays 9:45 am to 10:45 am.

Focus: This semester we will read papers that are from the proceedings of conferences such as the WWW 2004, 2005; ICML; etc. Students are encouraged to suggest readings aligned with their interests.
  1. February 25, 2005: Liu et al. Mining topic-specific concepts and definitions on the web. WWW 2004. (Aditya Sehgal)

  2. March 4, 2005: Gabrilovich, Dumais and Horvitz. Newsjunkie: Providing personalized newsfeeds via analysis of information novelty. WWW 2004. (Xin Ying)

  3. March 11, 2005: Etzioni et al. Web-scale information extraction in KnowItAll. WWW 2004 (Bob Ahrens).

  4. March 18, 2005: Spring Break

  5. March 25, 2005: Srivastava et al. Web usage mining: discovery and applications of usage patterns from web data. ACM SIGKDD proceedings, 2000 (Brian Almquist)

  6. April 1, 2005: Menczer. Evolution of document networks. PNAS, April 6, 2004, 101, suppl1, 5261-5265 (Aditya Sehgal)

  7. April 8, 2005: Abney and Light. Hiding a Semantic Hierarchy in a Markov Model. Proceedings of the ACL'99 Workshop on Unsupervised Learning in Natural Language Processing. (Marc Light)

  8. April 8, 2005: Lafferty et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. Proc. 18th International Conf. on Machine Learning, 282-289, 2001. (Marc Light)

  9. April 15, 2005: (continuing with:) Lafferty et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data.

    April 22, 2005: Koike et al. Automatic extraction of gene/protein functions from biological texts. Bioinformatics, 21,7,2005. (Xin Ying)

    May 6, 2005: Yom-tov et al. Improving document retrieval according to prediction of query difficulty. TREC 2004. (Aditya Sehgal with assistance from Padmini Srinivasan)