8th BigData/DataScience Meetup(ElasticSearch, HDFS and AWS)


Details
Here we are with the schedule for the next meetup !
We have three very interesting presentations about main HDFS (https://hadoop.apache.org/docs/r2.7.2/hadoop-project-dist/hadoop-hdfs/HdfsUserGuide.html) concepts and architecture, how to use AWS (https://aws.amazon.com/) to build a cluster ready for bigdata projects and elegant searches using ElasticSearch (https://www.elastic.co/)
Schedule of the meetup :
18:00 - 18:30
Food, drinks and time to meet new/old friends !
18:30 - 19:15
Elasticsearch by Andreea Hazi (https://ro.linkedin.com/in/andreea-hazi-126a4495/en) (SDL)
Processing data in real-time.
Introducing Elasticsearch's killer features.
Example of storing, searching & analyzing data.
Talking about use cases: Logstash, Elasticsearch, Kibana - the perfect log processing solution
19:15 - 19:55
HDFS by Tudor Lapusan (https://ro.linkedin.com/in/tudor-lapusan-5902593b) (Telenav)
HDFS is Hadoop distributed filesystem.
You would like to have as much data as HDFS can handle ! It can scale from tens, hundreds or even thousands of servers, being capable to store petabytes of data. From this presentation you will learn about HDFS main characteristics, its architecture and use-cases where it can/cannot be used.
19:55 - 20:05
Break
20:05 - 20:45
Automatic setup of EMR/Hadoop clusters in AWS by Bogdan Lupuț (betfair)
In this presenatation we'll use a combination of Python, shell scripting and Ansible to set up Hadoop clusters in AWS.
As a result the cluster will be entirely described in code, so it can be treated as such. It can be stored in a versioning system like Git, forked and shared with others.
Many thanks betfair (http://www.betfairromania.ro/) for hosting this meetup !

8th BigData/DataScience Meetup(ElasticSearch, HDFS and AWS)