• November: eDiscovery and EMNLP
    Meet us at the MakeOffices in Dupont Circle this month for two great talks. David Grossman (http://ir.cs.georgetown.edu/dagr/), Associate Director of Georgetown University's Information Retrieval Lab, will present FreeDiscovery. FreeDiscovery's initial focus is on eDiscovery for the legal community, and it provides a really nice, easy way to ReST enable any machine learning library. Further it simplifies comparing many algorithms against reference datasets and provides an easy API geared towards developers that don't necessarily need to know much about machine learning. Dan Simonson (https://twitter.com/thedansimonson), PhD candidate in computational linguistics at Georgetown University, will recap the good parts of this year's Empirical Methods in Natural Language Processing conference (http://www.emnlp2016.net/) (EMNLP). He'll also update us on his work investigating document categories with narrative schemas. ************************* We'll gather starting at 6:30pm at MakeOffices - we'll have someone to meet you in the lobby. Introductions & announcements will start around 7:00pm, and presentations will begin at 7:30pm. Afterwards, there will be plenty of time for follow-up questions, networking, and more. DC NLP meets each month to network, socialize, and learn about the interesting work folks are doing in natural language processing, computational linguistics, text analytics, and more. Do you have something you'd like to share with the group? Let us know! We're always looking for speakers to give talks at future meetups, and don't forget to follow @DCNLP (https://twitter.com/DCNLP/) on Twitter!

    MakeOffices - Dupont Circle

    1200 18th Street, NW Suite #700 · Washington, DC

    8 comments
  • NLP Hacks: Joint Meetup with Hack&&Tell
    DC NLP and Hack&&Tell are at the same time again this month - because we're getting together for a joint meetup! Let's try out the Hack&&Tell format and meet more of their great members interested in text processing. This October, we're looking for presenters, speakers, and hackers for anything related to human language, whether text, audio, videos of sign language, etc... show us what you’ve got! (remember: no startup pitches, dull work projects, or deckware). The format is informal, engaging, and friendly: presenters have *5* minutes to show off their hack (it goes faster than you’d expect) and then the audience has 5 minutes to ask questions, make jokes, and suggest ideas and feedback. Presentation signups are below: http://bit.ly/presentatdc Need an idea? Here are some talks we'd like to see at H&&T NLP: + Political twitter bots, bonus points if they pass the Turing test and fight w/real people!+ the weirdest take on a Turing test that you can come up with + Haiku! + Automated assistant hacks, or how I got Cortana to break up with me+ Pen-testing ("real" hacking!) with deep learning NLP + Linguistic analysis of everyone's favorite anti-blog gawker + Voice activation hacks, like getting Alexa to order a pizza when my dog barks or find the nearest polling place+ generating text based on your own email history (can you simulate yourself? your friends?) + unsupervised email classification (separate into personal, work, whatever) + recruit-bot: a bot that corresponds with recruiters for arbitrarily long exchanges (Q-learning on real people?) + multi-scale generative text (character + word??? + more?) + sentiment analysis for your work emails. Maybe don't read some of them right away. + baby names, learned from pictures of other people's babies and their names + generate hack and tell ideas?! Does it have to be related to human language? No, but if the speaking slots fill up, we will try to give priority to NLP this month. See you at the meetup, at WeWork Chinatown. This month we are starting at 6:30 with pizza and the talks will begin at 7:00. DC NLPers, be a little earlier than usual. Big thanks to WeWork for hosting us!

    WeWork 718 7th St NW

    718 7th St NW · Washington

    7 comments
  • August Meetup: two great talks PLUS a giveaway!
    After taking off the month of July, DC NLP is back with a bang in August! We'll be meeting at Comcast Labs DC again this month with two fantastic presentations on Deep Learning for NLP and advanced Search Engines. Also, courtesy of our friends at O'Reilly Media, we'll have a bunch of swag to give away including books, t-shirts, and other assorted goodies. This is sure to be an event you won't want to miss! First up, Ari Chanen (https://www.linkedin.com/in/ari-chanen-phd-3390253), Lead AI Engineer at MITRE (https://twitter.com/MITREcorp), will give a talk about his recently-published paper entitled "Deep Learning for Extracting Word-level Meaning from Safety Report Narratives" which he presented at the aviation-focused 2016 Integrated Communication Navigation and Surveillance (http://i-cns.org/) conference. This work is focused on practical questions surrounding the use of the deep learning technique word2vec. The first novel technique Ari will discuss is a way of picking a reasonable number of dimensions for a word2vec model which is based on the level of disagreement between a set of models trained on the same data. No training data is needed for this technique. The second novel technique is using multiple word2vec models together in an ensemble model to get a more robust set of similar terms over using a single word2vec model. For our second talk, Mark Lidd (https://www.linkedin.com/in/kramdl), Team Lead at DataFission (https://twitter.com/datafission), will give a presentation on DUSE: a content-based search engine for digesting and searching unstructured data (http://www10.giscafe.com/blogs/gissusan/2016/07/29/datafissions-new-duse-search-engine-lets-you-search-any-unstructured-data/). DUSE allows users to search anything digitally, from video, audio, network traffic, satellite images, radar data, malware, and images -- as well as unstructured text -- all in one search. Unlike traditional search engines such as Yahoo, Google, and Microsoft that have been around for 15 years, searching with DUSE doesn't require further work to search the medium the user requires. It contains both simple APIs and GUIs that provide rankings and answers to complex data queries to uniforms and analysts, as well as advanced controls for data scientists, such as access to live data structures and tools at a very deep level. ************************* A brief note from O'Reilly Media: Discover the real-world opportunities of applied artificial intelligence AI is (finally) driving an explosion in intelligent software—bots, agents, voice and IoT interfaces. Learn how to implement AI in real-world projects today and explore what the future holds for intelligence engineering. Save 20% with discount code UGDCNLP. Check out the impressive agenda and speaker lineup. http://www.oreilly.com/pub/cpc/21357 ************************ We'll gather starting at 6:30pm at Comcast Labs (http://dclabs.comcast.com/) - someone will greet you outside and let you into the building. Introductions & announcements will start around 7:00pm, and presentations will begin at 7:30pm. Afterwards, there will be plenty of time for follow-up questions, networking, and more. DC NLP meets each month to network, socialize, and learn about the interesting work folks are doing in natural language processing, computational linguistics, text analytics, and more. Do you have something you'd like to share with the group? Let us know! We're always looking for speakers to give talks at future meetups, and don't forget to follow @DCNLP (https://twitter.com/DCNLP/) on Twitter!

    Comcast Labs

    1110 Vermont Ave, NW · Washington, DC

    11 comments
  • June: the Ghost in the Search Machine
    Continuing our whirlwind tour of the DC area, this month we'll be meeting at WeWork Dupont Circle, right off the red line (location has been updated from Shaw). Come check out an exciting talk about the intersection of NLP and information retrieval in a cool co-working space! Doug Turnbull (https://twitter.com/softwaredoug) is a Search Relevance Consultant at OpenSource Connections (http://opensourceconnections.com/) and author of the Manning book Relevant Search (https://www.manning.com/books/relevant-search) (currently available for pre-order on Amazon (http://www.amazon.com/Relevant-Search-applications-Solr-Elasticsearch/dp/161729277X)). He crafts search & recommendation solutions using Solr/Elasticsearch, sprinkling a little natural language processing and machine learning on top for good measure. Doug's solutions make a big difference for companies like O'Reilly Media and Advance Auto Parts. His secret weapon is enabling non-technical product owners, businesses, and domain experts to take deeper control of their search. Through writing and speaking, Doug wants to humanize search and recommendations, making these topic less intimidating for everyone: developers, non-technical folks, and users alike. Doug loves building tools to help with search relevancy, including Quepid (http://quepid.com/), Splainer (http://splainer.io/) and Elyzer (http://github.com/o19s/elyzer). In this talk, you'll get an overview of search engine internals and learn how to turn Elasticsearch into a smart comparison powerhouse for your organization. Far more than just a full-text search engine, Elasticsearch can be 'taught' to detect at-risk students, predict the weather, and find similar images at high scale. ************************ We'll gather starting at 6:30pm at WeWork Wonder Bread Factory (https://www.wework.com/locations/washington-d-c/wonder-bread-factory). Introductions & announcements will start around 7:00pm, and presentations will begin at 7:30pm. Afterwards, there will be plenty of time for follow-up questions, networking, and more. DC NLP meets each month to network, socialize, and learn about the interesting work folks are doing in natural language processing, computational linguistics, text analytics, and more. Do you have something you'd like to share with the group? Let us know! We're always looking for speakers to give talks at future meetups, and don't forget to follow @DCNLP (https://twitter.com/DCNLP/) on Twitter!

    WeWork Dupont Circle

    1875 Connecticut Avenue NW · Washington, DC

    27 comments
  • May: the Power of Speech
    Another month, another new venue! This time, we're meeting in a space generously provided by Comcast, near the McPherson Sq & Farragut North metro stations. They're also sponsoring food for the event - many thanks to Comcast! We've got two great speakers on the lineup this month, both of whom will be giving presentations on speech-based systems. Vikrant Sagar (https://www.linkedin.com/in/sagarvikrant) completed a Masters in Computer Science at the University ​o​f Florida in 2013, and is now an NLP Engineer at Comcast Labs (http://dclabs.comcast.com/research/) in downtown Washington, DC. Comcast Labs' NLP group is developing applications around the voice-enabled TV remote, which is part of the new X1 Entertainment System. Vikrant's presentation will discuss the kinds of NLP problems being worked on by the team at Comcast Labs. Scott Novotney (https://www.linkedin.com/in/snovotney) has a PhD in Computer Science from Johns Hopkins and works as a Scientist at Raytheon BBN Technologies (http://www.bbn.com/) on multiple DARPA projects. His presentation is entitled Cheap, Fast and Good Enough: Transcribing Speech with Mechanical Turk: Historical estimates of human transcription of conversational speech require at a minimum of 10 hours of effort per hour. These costs quickly multiply when the number of transcribers who have access to sensitive data, such as medical records, becomes limited. In this talk, we engage non-expert transcribers who are given minimal instructions to bootstrap an automatic speech recognition system. With transcription costs of only five dollars per hour of data, the transcribers have an error rate of 25%. Nonetheless, when this data is used to train a statistical model, the gap between on-experts and professionals disappears. We then show how non-experts can be used to rank other non-expert transcribers without requiring any gold standard data. ************************ We'll gather starting at 6:30pm at Comcast Labs (http://dclabs.comcast.com/) - someone will greet you outside and let you into the building. Introductions & announcements will start around 7:00pm, and presentations will begin at 7:30pm. Afterwards, there will be plenty of time for follow-up questions, networking, and more. DC NLP meets each month to network, socialize, and learn about the interesting work folks are doing in natural language processing, computational linguistics, text analytics, and more. Do you have something you'd like to share with the group? Let us know! We're always looking for speakers to give talks at future meetups, and don't forget to follow @DCNLP (https://twitter.com/DCNLP/) on Twitter!

    Comcast Labs

    1110 Vermont Ave, NW · Washington, DC

    7 comments
  • textminR (w/ Statistical Programming DC)
    Join us for a joint event with Statistical Programming DC - we'll learn about Tommy Jones' new open-source R package for the statistical analysis of language. Please RSVP on their meetup page. (http://www.meetup.com/stats-prog-dc/events/230306971/) Abstract: textmineR introduces a framework for natural language processing (NLP) in R that improves upon current NLP frameworks available in R. Specifically, textmineR (https://cran.r-project.org/web/packages/textmineR/index.html) (development version here (https://github.com/TommyJones/textmineR/tree/2.0)) has a syntax that is more intuitive to experienced R users. It uses objects, methods, and functions that behave like regular R dense matrices. This lowers the barrier to beginning statistical analyses of language to statisticians and other data professionals. Bio:Tommy Jones (https://twitter.com/thos_jones) is the director of data science at Impact Research, LLC. He holds an MS in mathematics and statistics from Georgetown University and a BA in economics from the College of William and Mary. He is a PhD student in the George Mason University Department of Computational and Data Sciences. He specializes in statistical models of language and time series modeling. Slightly different format for the joint meetup: 6:30 - 7:00 Food, drinks and meet new people 7:00 - 7:15 Intros and announcements 7:15 - 8:30 Presentation, Q&A 8:30ish Off to the bar

    GWU, Funger Hall, Room 103

    2201 G St. NW · Washington, DC

  • February: NLP at DARPA. NEW LOCATION
    This February, we are trying out a new day (MONDAY) and a new location (The Big Hunt, right off Dupont Circle). We're hoping this works out as a long-term home for us. Join us as we check it out. Our speaker this month is from the Defense Advanced Research Projects Agency (better known as DARPA). Come get his perspective on how DARPA's using NLP today and the direction it will take in the future: DARPA has been a major driver of NLP research in the US for several decades. In this talk, Boyan Onyshkevych, currently a Program Manager at DARPA, will discuss several current NLP-related DARPA programs, and will provide his perspective on future NLP research directions. Dr. Boyan Onyshkevych is a Program Manager at DARPA's Information Innovation Office. His research interests include human language technologies and knowledge-based systems applied to the areas of information extraction, language understanding and semantic computing. ************************ We'll meet at 6:30pm at the Big Hunt, right of Dupont Circle. We'll be in the Jungle Room, the bar to the left. We'll do intros around 7:00 and move into the talk right around 7:30. DC NLP meets in the second week of each month to network, socialize, and learn about the interesting work folks are doing in natural language processing, computational linguistics, text analytics, and more. Do you have something you'd like to share with the group? Let us know! We're always looking for speakers to give talks at future meetups, and don't forget to follow @DCNLP (https://twitter.com/DCNLP/) on Twitter!

    The Big Hunt

    1345 Connecticut Ave NW · Washington, DC

    9 comments
  • January: An Intro to Sentiment Analysis
    January is our second month in the space generously shared with us by LawIQ (https://www.linkedin.com/company/lawiq). (It's right between Farragut Square and Dupont Circle.) They're also sponsoring pizza and beer - huge thanks to them. Brian Sacash (https://twitter.com/bsacash) will present an intro to natural language processing through sentiment analysis. Brian is a data scientist at Koverse where he regularly deals with Natural Language Processing. He is also the author of “Introduction to Natural Language Processing (https://www.udemy.com/natural-language-processing/)” on Udemy.com. In this presentation he will demonstrate how to build a basic sentiment analysis engine using Python and the Natural Language Tool Kit. ************************ We'll meet at 6:30pm a new location for us: LawIQ at 1725 DeSales Street, NW Suite 500. We'll do intros around 7:00 and move into the talk right around 7:30. DC NLP meets on the second Wednesday of each month to network, socialize, and learn about the interesting work folks are doing in natural language processing, computational linguistics, text analytics, and more. Do you have something you'd like to share with the group? Let us know! We're always looking for speakers to give talks at future meetups, and don't forget to follow @DCNLP (https://twitter.com/DCNLP/) on Twitter!

    LawIQ

    1725 DeSales Street NW · Washington, DC

    28 comments
  • December: Lightning Talks and Open Mic in a New Space
    New location! We're meeting at a space generously shared with us by LawIQ (https://www.linkedin.com/company/lawiq). (It's right between Farragut Square and Dupont Circle.) They're also sponsoring pizza and beer - huge thanks to them. As we head towards the holidays, Open Mic Night returns to DC NLP! Here's your opportunity to steal the stage for a lightning talk on the topic of your choice. No need for anything fancy, but if you prepare slides or a live demo, we'll get it on the TV screen. For anyone interested in speaking at open mic night, here are a few basic rules: • Everyone who wants to (time permitting) will get 5 to 10 minutes to talk about anything they like. You can give a quick talk about something you've been working on, ask a question about a problem you've been struggling with, run a demo for a interesting tool you've found, etc. • The only rule in terms of content is that it must be related to natural language processing. • You can use slides or a computer, but it's certainly not required. You can just get up in front of the group and ramble on for 5-10 minutes if you like. We won't judge you! Audience interaction is highly encouraged! If you'd like to guarantee yourself a spot, send a note to the organizers or leave a comment below -- walk-ins are also always welcome. ************************ We'll meet at 6:30pm a new location for us: LawIQ at 1725 DeSales Street, NW Suite 500. We'll do intros around 7:00 and move into talks right around 7:30. DC NLP meets on the second Wednesday of each month to network, socialize, and learn about the interesting work folks are doing in natural language processing, computational linguistics, text analytics, and more. Do you have something you'd like to share with the group? Let us know! We're always looking for speakers to give talks at future meetups, and don't forget to follow @DCNLP (https://twitter.com/DCNLP/) on Twitter!

    LawIQ

    1725 DeSales Street NW · Washington, DC

    21 comments
  • Basics of NLP
    This event was canceled.