Jonathan Cornelissen tells us about DataCamp, the need for data scientists, and how to become one yourself. We also learn about some popular languages and libraries for analyzing data.
"If you're in a healthcare company you might want to try to predict the effectiveness of new drugs."
"If you're a dating site you may want to analyze data to predict what people will be a good match."
"You're being experimented with all the time as an internet user."
"Everybody w/ a STEM background willing to put in some effort can pretty quickly learn the basics."
"Eventually most companies will have some sort of data science department."
"Big data literally means a lot of data. The bigger trend is data science and big data is a subset."
"There's a huge demand for people who combine programming with knowledge of statistics and business."
"The median wage for somebody with a title of data scientist is between $105k and $144k."
"There will be a shortage of about 200,000 data scientists by 2018. And that's in the US alone."
"More and more people are trying to analyze data online, which is very challenging technically."
"R is the most popular statistical computing language; now gaining traction in the business world."
"What makes R super powerful is its community which has developed enormous amounts of functionality."
"You have a base R version and you just load in packages just like you would load in a gem in Ruby."
"Many people continue with Python to analyze their data just because it's something they know."
"You need to understand what you're looking for in data and you need to interpret what comes out."
"Often people see the value of analyzing data and Excel just doesn't do the work anymore."