San Francisco Hadoop Users Message Board › Feb 2011 - Hadoop and RDBMS

Feb 2011 - Hadoop and RDBMS

A former member
Post #: 1
* Hadoop is not SQL
* Hadoop learning curve is about 3--4 months
* RDBMS + Hadoop useful together for time-based hierarchical storage
* Storage rules of thumb:
** Low value (raw, experimental) data in Hadoop
** High volume data should be in Hadoop
** high processing latency, and scales poorly.
** But better for interactive queries
* Tools:
** Sqoop
** Hive
** Custom tools to import data back into RDBMS
** Custom tools to generate Hive schemas from SQL schemas
* Goals:
** Move data into hadoop
** translate/simulate SQL queries in Hadoop jobs
** Move data back into RDBMS for integration with other systems
Powered by mvnForum

Our Sponsors

  • Cloudera

    Cloudera is the organizer of this meetup.

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy