Science, Python, R, and "Py-lingualism"­ -- PyHou: Houston Python Meetup

Hello Pythoneers!

Scott Malec will discuss how the data science arena can benefit from greater "py-lingualism" and general interoperability, and share "in the trenches" impressions and experiences with the strengths and shortcomings of Python, R, and languages/platforms/ecosystems (i.e., general purpose language vs more specialized, R&D focused vs production ready, etc.) for data science.    

Our meetup is located at the Stag's Head Pub, near Richmond and Shepherd. Please enjoy their menu and drink offerings, so they continue to enjoy hosting us! Lastly, please RSVP (even if you're a "Maybe"!) to assist us with capacity planning.

Share your Lightning Talks! A "lightning talk" is a quick mini-presentation (5 minutes maximum) on any Python topic you'd like.  We'll save time for 2-3 of these each month: to save a spot, post a comment with your talk's title/topic. Beginner topics are always welcome!

Presentations Wanted: If you'd like to share a topic with the group at an upcoming, let's talk!

Join or login to comment.

  • Micah S.

    http://hpl.lib.tx.us/learn-expl...­ Learn to program using Python and make things in Minecraft - the class that Neeraj teaches, I believe

    4 · December 16

  • Micah S.

    http://epiphanet.uth.tmc.edu­ the project scott mentioned

    December 16

    • Scott M.

      WORKFLOW: drug: thiazolidinediones, side-effect: acute_edema. Then click on 'GET_PATHWAYS'­. The click on radio button next to STIMULATES | PRODUCES | ASSOCIATED_WITH-INV and behold the d3.js forced DAG.

      1 · December 17

    • Scott M.

      You can then click on the predicates to see list of publications that contain that predicate and the sentence from which that predicate was extracted within the publication (medline).

      1 · December 17

  • Scott M.

    Some R goodness/tips&tricks: http://paulbutler.org/archives/...­

    1 · December 17

  • Micah S.

    http://stackoverflow.com/questi...­ what is an rdf triple?

    December 16

    • Scott M.

      explanation of what a triple/predication is within bioinformatic context: http://www.sciencedir...­

      1 · December 16

    • Micah S.

      'Our approach, named Predication-based Semantic Indexing, is utilized to identify empirically sequences of relationships known as “discovery patterns”, such as “drug x INHIBITS substance y, substance y CAUSES disease z” that link pharmaceutical substances to diseases they are known to treat'

      December 16

  • Micah S.

    https://www.kickstarter.com/proj...­ a kickstarter project - high speed internet at the Stag's Head

    1 · December 16

  • Micah S.

    http://sociology.emory.edu/home...­ ROBERTO FRANZOSI (BA literature, University of Genoa 1975; PhD Sociology, Johns Hopkins 1981) is professor of Sociology and Linguistics.

    1 · December 16

  • FP

    Just got here

    1 · December 16

  • Micah S.

    https://txrxlabs.org/classes/int...­ Intro to Programming with Python at Tx/Rx - the course that Kojo is teaching

    3 · December 16

  • Micah S.

    https://en.wikipedia.org/wiki/Da...­ David Everett Rumelhart was an American psychologist who made many contributions to the formal analysis of human cognition, working primarily within the frameworks of mathematical psychology, symbolic artificial intelligence, and parallel distributed processing. He also admired formal linguistic approaches to cognition, and explored the possibility of formulating a formal grammar to capture the structure of stories.

    1 · December 16

  • Mohan R.

    Text tiling Python source: http://www.nltk.org/_modules/nl...­

    1 · December 16

  • Micah S.

    http://janav.wordpress.com/2013...­ Tf-Idf and Cosine similarity // we can find out the similarity between any two documents.

    December 16

    • Micah S.

      "The cosine measure similarity is another similarity metric that depends on envisioning user preferences as points in space. Hold in mind the image of user preferences as points in an n-dimensional space. Now imagine two lines from the origin, or point (0,0,…,0), to each of these two points. When two users are similar, they’ll have similar ratings, and so will be relatively close in space—at least, they’ll be in roughly the same direction from the origin. The angle formed between these two lines will be relatively small...

      December 16

    • Micah S.

      ...In contrast, when the two users are dissimilar, their points will be distant, and likely in different directions from the origin, forming a wide angle. This angle can be used as the basis for a similarity metric in the same way that the Euclidean distance was used to form a similarity metric. In this case, the cosine of the angle leads to a similarity value. If you’re rusty on trigonometry, all you need to remember to understand this is that the cosine value is always between –1 and 1: the cosine of a small angle is near 1, and the cosine of a large angle near 180 degrees is close to –1. This is good, because small angles should map to high similarity, near 1, and large angles should map to near –1."

      December 16

  • Micah S.

    http://pyevolve.sourceforge.net...­ ...now what we are going to learn is how to use a well-known similarity measure (Cosine Similarity) to calculate the similarity between different documents.

    December 16

  • Micah S.

    https://en.wikipedia.org/wiki/Co...­ Cosine similarity is a measure of similarity between two vectors of an inner product space that measures the cosine of the angle between them.

    December 16

  • FP

    Would it be rude to drop in late?

    December 16

    • Micah S.

      of course not. come on down!

      December 16

  • Scott M.

    Here is a draft version of the slides from Scott Malec, tomorrow's humble speaker: https://www.dropbox.com/s/yf5bfs...­

    2 · December 15

    • Scott M.

      I am creating a github repo so that you all can follow along with code/data examples. Will include syntax for all major platforms, but you will have to adjust the paths for your own respective preferences: https://github.com/kin...­

      1 · December 16

  • Neeraj T.

    This seems like a really interesting topic. I've used both Python and R for data science too, looking forward to the discussion.

    2 · December 9

    • Neeraj T.

      R, on the other hand, will get pissed off if you try to do this.

      December 9

    • Mark H.

      I've worked with R and Pandas a bit, and I remember coming across the fact that Pandas doesn't distinguish between NaN and Null values, whereas R does. Correct me if I'm wrong here. But anyway, there are ways of weeding out NaNs regardless.

      1 · December 10

Sometimes the best Meetup Group is the one you start

Get started Learn more
Rafaël

We just grab a coffee and speak French. Some people have been coming every week for months... it creates a kind of warmth to the group.

Rafaël, started French Conversation Group

Start your Meetup today

Act now and get 50% off.
Until February 1.

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy