addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1light-bulblinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Re: [ljc] Opensource Identity Resolution software query

From: @sleepyfox
Sent on: Monday, February 6, 2012 3:07 PM
The problem of person-identification and de-duplication of personal profile data is all part of master data management and is non-trivial. I encountered the problem ten years ago in the Healthcare space and we (GE) gave up on solving it internally in the application and sub-contracted the work to specialists that do just this specific thing as their sole business. If you have a sufficient volume of information then you probably will find sub-ing the work as the most cost-effective solution. If not, then you can get a person to do it and you'll probably get better-quality output than you will by making a half-assed pass it doing it programatically. 

The situation *may* have changed with the cheap availability of ML, I'd be interested to hear of anyone else's more recent experience in this area.


On 6 February[masked]:33, Saiprasad Krishnamurthy <[address removed]> wrote:
Hi Folks,

                Is there an opensource software available for Identity Resolution that does fuzzy,wild-card,phonetic name matches for different locales. I was looking at SOLR and it looks promising as a fast search server using a bunch of search strategies provided off the shelf. But not quite sure if it will fit into my use-case. Really excited to know your comments on this.


Please Note: If you hit "REPLY", your message will be sent to everyone on this mailing list ([address removed])
This message was sent by Saiprasad Krishnamurthy ([address removed]) from LJC - London Java Community.
To learn more about Saiprasad Krishnamurthy, visit his/her member profile
Set my mailing list to email me As they are sent | In one daily email | Don't send me mailing list messages

Meetup, PO Box 4668 #37895 New York, New York[masked] | [address removed]

Our Sponsors

  • Our Blog

    Read the latest news from the LJC

  • RecWorks Ltd

    Fixing Tech Recruitment using the Power of Community

  • jClarity

    Java/JVM Performance Analysis Tools & mentoring for Java related matters

  • LJC Aggrity

    Our LJC Aggrity site contains blog posts from our members

  • LJC Book Club

    Our Book club with book reviews from our members

  • Devoxx UK

    Java Community Conference in collaboration with the LJC, 8-10th June 16

  • SkillsMatter

    "Host, help organise, promote, film many of our meetings."

  • IBM

    Build Enterprise-grade apps at start-up speed.

  • New Relic

    New Relic makes sense of billions of metrics a day in real time.

  • Hazelcast

    Hazelcast is the leader in operating in-memory computing.

  • Java.Net

    We are an official Java User Group recognised by Oracle's JUG program

  • JRebel

    Free 3 month J-Rebel license.

  • O'Reilly

    40% discount on printed books and 50% on e-books.

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy