Skip to content

Online Monthly Prize Draw: Packt Publishing - Free Tech Book

Photo of Ged Byrne
Hosted By
Ged B. and Barry C.
Online Monthly Prize Draw: Packt Publishing - Free Tech Book

Details

Please note this is an online monthly prize draw and no actual meeting will take place

http://photos2.meetupstatic.com/photos/event/b/e/9/c/event_14868796.jpeg (http://www.packtpub.com/)

Data Science

http://www.blogging4jobs.com/wp-content/uploads/2012/12/data-science-history.jpg (http://www.blogging4jobs.com/hr/why-your-next-hr-hire-should-be-a-data-scientist/) The role of Data Scientist is new and important. Big Data is seen as the key area for innovation, and the Data Scientist is a key role in putting Big Data to work.

So who are the Data Scientists, what do they do and does it include skills that we as developers may want to acquire?

The Data Scientist mines data for useful insights. It's a role that is closely related to the Computer Scientist, the role often fulfilled by developers like us.

Last month we looked at the tools used by the Computer Scientist, this month we look at the skills and tools needed by the Data Scientist.

The Data Scientist Role

In his Book "Data Visualisation - a Successful Design Process" Andy Kirk identifies the "Eight Hats" of data visualisation design:

The Initiator - The leader who is seeks a solution. The Data Scientist - The data miner, wearing a miners hat, discovering nuggets of insight buried deep within the numbers. The Journalist - the story teller who refines the insight with narrative and context. The Computer Scientist - The person who breaths life into the project with their breadth of software and programming literacy. The Designer - With an eye for visual detail and a flair for innovation they work with the computer scientist to ensure harmony between form and function. The Cognitive Scientist - Brings an understanding of visual perception, colour theories and human-computer interaction to inform the design process. The Communicator - The negotiator and presenter who acts as the client-customer-designer gateway. The Project Manager - The co-ordinator who picks up the unpopular duties and makes sure that the project is cohesive, on time and on message. These are hats, and we will probably find ourselves wearing several of them over time. As you can see, Data Visualisation requires us to pull together a range of disciplines in order to achieve something meaningful.

Last month we focused on the skills of the Computer Scientist, looking at the skills needed to pull the data out of the repository and put it in front of the audience.

http://ljcbookclub.files.wordpress.com/2013/05/miner-willy.jpg This month we are looking at the skills of the Data Scientist. Here's Kirk's full description:

The data scientist is characterized as the data miner, wearing the miner's hat. They are responsible for sourcing, acquiring, handling, and preparing the data. This means demonstrating the technical skills to work with data sets large and small and of many different types. Once acquired, the data scientist is responsible for examining and preparing the data. In this proposed skill set model, it is the data scientist who will hold the key statistical and mathematical knowledge and they will apply this to undertake exploratory visual analysis to learn about the patterns, relationships, and descriptive properties of the data.

From Chapter 2 of Data Visualisation - A Successful Design Process (http://www.packtpub.com/data-visualization-a-successful-design-process/book)

Last month we talked about data being the new soil. The data scientist is a miner who digs down deep. It is a pivotal roll in the design process. Kirk elaborates further:

If we don't have the data we want, or the data we do have doesn't tell us what we hoped it would, or the findings we unearth aren't as interesting as we wish them to be there is nothing we can (legitimately) do about it. That is an important factor to remember. No amount of 3D-snazzy-cool-fancy-design dust sprinkled on to a project can change that.

An incomplete, error strewn or just plain dull dataset will simply contaminate your visualization with the same properties. So, the primary duty for us now is to avoid this happening, remove all guessing and hoping, and just get on with the task of acquiring our data and immerse ourselves into it to learn about its condition, its characteristics, and the potential stories it contains.

From Chapter 3 of Data Visualisation - A Successful Design Process (http://www.packtpub.com/data-visualization-a-successful-design-process/book)

This month we're going to look at some of the tools we can use as Data Scientists to immerse ourselves in the data. Tools that will help us to interact with our data, drill down into it's seams and discover what nuggets lie within.

See the blog post (http://wp.me/p1FFAt-7Y) on the LJC Book Club for more details.

If you want to have a chance of winning one of this months books then please sign up on the Meetup page. At the end of May the lucky winner will get to choose a physical copy and the runner up can select an ebook.

Data Visualisation: a successful design process (http://www.packtpub.com/data-visualization-a-successful-design-process/book) Learning Highcharts for Javascript Data Visualisation (http://www.packtpub.com/learning-highcharts-for-javascript-data-visualization/book) Data Analysis Cookbook (http://www.packtpub.com/clojure-data-analysis-cookbook/book) Statistical Analysis with R - Beginners Guide (http://www.packtpub.com/statistical-analysis-with-r-beginners-guide/book) MapReduce Cookbook (http://www.packtpub.com/hadoop-mapreduce-cookbook/book) Infinispan Data Grid Platform (http://www.packtpub.com/infinispan-data-grid-platform/book) To take part in the promotion all you have to do is RSVP to this event on Meetup and complete the questions.

Packt Publishing (http://www.packtpub.com/) are a unique publishing company specializing in highly focused books on specific technologies and solutions - please visit their site to find out more about them: www.packtpub.com (http://www.packtpub.com/)

Good luck!

Organised by RecWorks
http://photos3.meetupstatic.com/photos/event/c/5/0/0/event_140270432.jpeg (http://developercareers.recworks.co.uk/online/JobSearchResults.aspx)

Please note this is an online monthly prize draw and no actual meeting will take place

Photo of LJC - London Java Community group
LJC - London Java Community
See more events
This is an online promotion · There will be no physical meetup