addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1light-bulblinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Semantic Web Meetup at the New York Times

  • Dec 4, 2008 · 6:30 PM
  • This location is shown only to members

The New York Times Company invites you, the New York Semantic Web Meetup to join Evan Sandhaus, Semantic Technologist, at The New York Times Building. Evan will present his work on The New York Times Annotated Corpus.

The New York Times Annotated Corpus is a collection of over 1.8 million articles annotated with rich metadata published by The New York Times between January 1, 1987 and July 19, 2007. With over 650,000 individually written summaries and 1.5 million manually tagged articles, it is my hope that The New York Times Annotated Corpus will prove to be a valuable resource for a number of natural language processing research areas, including document summarization, document categorization and automatic content extraction.

The New York Times is releasing the data in conjunction with the Linguistic Data Consortium. Details for obtaining the corpus can be found on their website at:


The New York Times


Marco Neumann, KONA

Introduction to R&D at the New York Times
Gregg Fenton - Director of Emerging Platforms at The New York Times The New York Times Corpus
Evan Sandhaus - Semantic Technologist
Kristi Reilly - Information Architect

The use of Microformats in Production at the New York Times
Andrei Scheinkman - Software Engineer

Join or login to comment.

  • Morton S.

    Wow. What a venue! What a lot a really useful stuff coming from the New York Times! I'm looking forward to the day they launch the microformats technology!

    December 8, 2008

  • Wlodek Z.

    I enjoyed the talks, wished though for more technical details about the actual content of the disks. Had a couple of very interesting discussions after.

    December 5, 2008

  • Gary L. K.

    The NYT Corpus is really interesting, its great that it is being released to the community. The presentation by the NYT staff was terrific; it is clear how very excited they are about the possibilities that can be performed on their data.

    December 5, 2008

  • A former member
    A former member

    Great presentation by Evan Sandhaus and the NYT team. Looking forward to using this NYT/LDC corpus.

    December 4, 2008

  • Maria C. F.

    Just the right size group to truly get a sense of who the other participants are.
    Well structured presentation, interesting topic.

    December 4, 2008

  • A former member
    A former member

    Very interesting...more semantics than web, but very informative.

    December 4, 2008

60 went

Your organizer's refund policy for Semantic Web Meetup at the New York Times

Refunds offered if:

  • the Meetup is cancelled

Payments you make go to the organizer, not to Meetup. You must make refund requests to the organizer.

Regardless of the refund policy set by the organizer, Meetup may issue refunds on an organizer's behalf if we determine that Meetup's Payment Policies have been violated.

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy