skip to main content
10.3115/1219840.1219877dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free Access

Digesting virtual "geek" culture: the summarization of technical internet relay chats

Authors Info & Claims
Published:25 June 2005Publication History

ABSTRACT

This paper describes a summarization system for technical chats and emails on the Linux kernel. To reflect the complexity and sophistication of the discussions, they are clustered according to subtopic structure on the sub-message level, and immediate responding pairs are identified through machine learning methods. A resulting summary consists of one or more mini-summaries, each on a subtopic from the discussion.

References

  1. M. S. Ackerman and C. Halverson. 2000. Reexaming organizational memory. Communications of the ACM, 43(1), 59--64. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Berger, S. Della Pietra, and V. Della Pietra. 1996. A maximum entropy approach to natural language processing. Computational Linguistics, 22(1):39--71. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Elliott and W. Scacchi. 2004. Free software development: cooperation and conflict in a virtual organizational culture. S. Koch (ed.), Free/Open Source Software Development, IDEA publishing, 2004.Google ScholarGoogle Scholar
  4. W. B. Frakes and R. Baeza-Yates. 1992. Information retrieval: data structures & algorithms. Prentice Hall. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Galley, K. McKeown, J. Hirschberg, and E. Shriberg. 2004. Identifying agreement and disagreement in conversational speech: use of Bayesian networks to model pragmatic dependencies. In the Proceedings of ACL-04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. A. Hearst. 1994. Multi-paragraph segmentation of expository text. In the Proceedings of ACL 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. T. Joachims. 1998. Text categorization with support vector machines: Learning with many relevant features. In Proceedings of the ECML, pages 137--142. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. Lam and S. L. Rohall. 2002. Exploiting e-mail structure to improve summarization. Technical Paper at IBM Watson Research Center #20-02.Google ScholarGoogle Scholar
  9. S. Levinson. 1983. Pragmatics. Cambridge University Press.Google ScholarGoogle Scholar
  10. P. Newman and J. Blitzer. 2002. Summarizing archived discussions: a beginning. In Proceedings of Intelligent User Interfaces. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. O. Rambow, L. Shrestha, J. Chen and C. Laurdisen. 2004. Summarizing email threads. In Proceedings of HLT-NAACL 2004: Short Papers. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. Ries. 2001. Segmenting conversations by topic, initiative, and style. In Proceedings of SIGIR Workshop: Information Retrieval Techniques for Speech Applications 2001: 51--66. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. E. A. Schegloff and H. Sacks. 1973. Opening up closings. Semiotica, 7--4:289--327.Google ScholarGoogle Scholar
  14. S. Wan and K. McKeown. 2004. Generating overview summaries of ongoing email thread discussions. In Proceedings of COLING 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. H. Ward Jr. and M. E. Hook. 1963. Application of an hierarchical grouping procedure to a problem of grouping profiles. Educational and Psychological Measurement, 23, 69--81.Google ScholarGoogle ScholarCross RefCross Ref
  16. K. Zechner. 2001. Automatic generation of concise summaries of spoken dialogues in unrestricted domains. In Proceedings of SIGIR 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Digesting virtual "geek" culture: the summarization of technical internet relay chats

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image DL Hosted proceedings
      ACL '05: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
      June 2005
      657 pages
      • General Chair:
      • Kevin Knight

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      • Published: 25 June 2005

      Qualifiers

      • Article

      Acceptance Rates

      ACL '05 Paper Acceptance Rate77of423submissions,18%Overall Acceptance Rate85of443submissions,19%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader