addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramlinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Natural Language Pythoning

  • Apr 28, 2016 · 7:00 PM
  • This location is shown only to members

Join NYC Python for a night of talks about loosely and dynamically typed languages with no discernible syntax rules!


Slicing Up Words with Python - Steven Butler

Morphological segmentation is the process of splitting words up in their component morphemes (smallest grammatical units in a language*) in order to improve a variety of downstream NLP tasks, like machine translation and keyword search. For well-resourced languages with well-understood morphological patterns (like English), this is no longer very much of an issue, but for low-resource languages (those that don't have a lot of NLP research done on them), this problem is a lot harder. I'll demonstrate a Python library called Morfessor (an interface to a family of unsupervised segmentation algorithms developed by a research team in Finland) that has been used for working on low-resource language segmentation, and talk a little bit about my work, which is trying to improve segmentation accuracy when morphological patterns get exotic**.

* e.g., 'words' -> ['word', 's']
** infixation, partial reduplication, etc.

Jargon-term Extraction by Chunking - Angus Grieve-Smith, Adam Meyers

One major subfield of NLP, Information Extraction (IE), deals with finding needles in haystacks - taking a huge corpus of text and pulling out the most important parts, like frequently used entities (people, places or things) and the relations between them.  At NYU we developed methods for extracting jargon terms - terms that are used more frequently in particular fields than in general writing.  These methods involved a combination of close annotation of the documents, algorithms to find particular patterns, and machine learning to extend the annotation.

N-grams: What they are and what you can do with them - Max Schwartz

I will touch on text classification, text generation, and how N-grams can fit in to solutions to problems like word segmentation and part-of-speech tagging.

The Ministry of Silly Talks

About our Sponsor:

Based in New York City, Vettery is a trusted hiring marketplace built to help tech candidates find their dream job. They take the best from the recruiting and technology worlds by connecting employers and talent through a curated marketplace of vetted candidates. Vettery currently works with over 1,000 companies including Uber, Jet, Etsy, Flatiron Health, Digital Ocean, SeatGeek, and Casper. For more information on being featured, please apply on

Join or login to comment.

  • Calculus

    Very sorry disturbing. I want a free link learning python from beginning. PDF or any. I want to become professional programmer and lots of people suggested me start with Python. I can't find any good source for learning in Afghanistan.

    July 3

  • Steven B.

    Here are the files from my talk:

    3 · April 29

  • Brien B.

    Awesome talks on very interesting subjects

    1 · April 29

  • Max S.

    My slides and ipython file can be found here:
    Thanks again to the hosts.

    2 · April 29

  • Robert O.

    Slide decks please

    3 · April 25

  • Kenneth F.

    Sorry for the late RSVP drop. Not getting out of work anytime soon due to some bug issues.

    April 28

  • @aaronchall

    Be sure to release your RSVP or get off the waitlist if you can't make it so that people at the top of the waitlist can get in. Note that we're paying closer attention to no-shows. Thanks, everyone!

    April 28

  • Dwight B.

    Just dropped out at 13:10. 1 spot now available.

    April 28

  • David G.

    Really disappointed I can't attend but something else can up. Hope whoever I let in off the wait list has a great time!

    1 · April 28

  • Sreenadh P.

    Would this event be webcast?

    April 27

    • Jerry M.

      There will not be a webcast.

      April 27

  • Maxwell R.

    188 waiting. :-|

    I'll have to catch the next one.

    April 25

  • Noel

    hi..sorry.. cant make it.. can we have the presentation(ppt or video) share latter.

    Thanks in advance!

    April 24

  • Md I.

    Currently on the waitlist with 77 people ahead of me...

    1 · April 21

  • Adrian C.

    Nooo.... can we do this another night?! I can't make this date. Well, enjoy anyhow. Cheers all! 8)

    1 · April 21

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy