Data Science for analytical minds

Introduction to 5-part practical guide on becoming a data scientist.

Are you someone who:

  • Has studied concepts of statistics, econometrics or mathematics?
  • Loves reasoning, creating wild hypothesis and findings flaws in your concepts or theories?
  • Planning to start a career in data science* or looking for ways to accelerate your data science career?

If yes, then you’ve landed on the right 5 part blog series which will provide you with all the practical information that you will need to derive real life value from data.

Is data science meant for me?

I think yes. As an economics post graduate and Principal Data Scientist at Atlan, I look at data science as a beautiful child of economics and computer science, that takes the best from both parents and forms its own shape and personality.

As economists, we are trained to understand and find patterns at all levels — micro and macro — as well as across varied domains such as development economics, agricultural economics, environmental economics, and others. We are trained in converting these patterns into mathematical equations to empirically test and communicate the results with relevant stakeholders.

A data scientist is someone who has the expertise of a trained economist along with the knowledge of working with big data, know-how of efficient usage of computer power and an engineering attitude of killing all repeatable tasks.

If you also feel that you understand some of these components of data science and have a knack for numbers, patterns, and an analytical bent of mind, then you can easily create a place for yourself in Data Science, just like I have. 🙂

What are career prospects in data science?

Data Science is rapidly changing the face of various industries with new innovations. Across organisations, data is now a core function like sales, marketing, or product and design.

Any company that has adopted data science has a competitive advantage over others in the industry because of the vast knowledge and faster learning and course-correction that the business becomes suddenly capable of. According to the 2018 Data Security Confidence Index from Gemalto, 89 percent of the people polled knew that if they could learn to analyse their information properly, they’d have a competitive edge.

In academia, research departments are learning data science and hiring experts to scale their research. Paul Romer, the winner of Nobel Prize in Economics in 2018, moved to Python from STATA and Mathematica to be able to benefit from its ability to handle massive and various types of datasets and gain the opportunity to share his work in the open source community.

Elections are being contested leveraging the power of data science across the world, be it the Brexit campaign or US election campaign done by Cambridge Analytica.

Across the world, this has led to a surge in the demand of data science professionals. In 2019, LinkedIn picked data science as the most promising career.

All this demand is not being catered to by the available supply. Indeed’s January 2019 report showed a 29% increase in demand for data scientists year-on-year and a 344% increase since 2013 — which is a dramatic upswing. But while demand has exploded — in the form of job postings — searches by job seekers skilled in data science has grown at a slower pace (14%), suggesting a vast and growing gap between supply and demand.

How is a day in the life of a data scientist?

Let’s take an example of a data science problem to understand this better.

Imagine that you or your company has been funded to predict monthly malaria cases for the next 2 years for every village and town of India.

Given that a whole host of factors can cause or accelerate the incidence of malaria, you need a lot of diverse data indicators coming in from a whole range of data sources: satellites for weather information, spatial directories for village locations, road network and more, statistical databases in PDF or excel format for hospital directory, income estimates, sanitation records, and more. All these datasets consist of different data types, geography granularity, and frequency of update.

As the data scientist on the job, you should be able to:

  • Do background research on the problem statement and make an execution plan accounting restricted timelines, resources, and skillset of you and your team.
  • Find and curate right datasets mapped to required indicators needed as per the problem statement
  • Process and clean all different data types and bind them together
  • Do explanatory analysis to find the right indicators/features of your prediction/forecasting model.
  • Build a model that works for spatial data, time series data as well as cross sectional data.
  • Write a script that can run every month with minimal human interference completing all steps including data sourcing, data cleaning, master database creation, running prediction model
  • Create a regularly updated visualisation dashboard so that the insights can be consumed and shared with relevant stakeholders.

Looking at the problem statement, most of you, post research on malaria occurrence and prediction, would be able to design the solution and workflow but turning this into reality is the job of a data scientist.

So what all skills make a good data scientist?

Work, work, work…

We will go deep into every component in the upcoming parts of the blog series. A brief for now:

Globally, a data scientist typically spends 80% of their time on data cleaning and preparation. Why is that? It’s because the data scientist has to deal with datasets that are of different data types, file formats, cleaning process, frequency of update, aggregation level, data size, and such other dimensions. Before getting to the analysis or modelling part, there is usually a huge time consuming process of data processing required to make sense of any data. Therefore, the ability to correctly automate data processing is one of the most desired skillsets of any data scientist.

  • Analysis & modelling-

This would involve a deep understanding of statistics and mathematics to build and use various analytical or predictive ML models. Anyone who studies and enjoys mathematics and statistics, knows the basics by heart. Come on, just recite all OLS assumptions in your head now and you will know what I mean.. But, what we’re not taught are advanced ML techniques like XGboost, Support Vector machines, neural nets, and so on. As I set out to learn these techniques on-the-job, I realised that these are just more powerful extensions of the same statistical and mathematical models like logistic regression, linear optimisation, and bayesian probability that we have all studied as a part of our curriculum. We’re planning to go deep into these techniques later in the blog series.

  • Programming languages-

Data science requires a lot of data wrangling, multiple model testing and optimisation along with visualisation to draw right insights, make inferences, do predictions or enable decisions. What’s more is that this needs to be done every day with varying data scale and capacity within the organisation. Therefore, knowing scripting programming languages that can best automate and optimise the work — from data cleaning to visualisation — is essential for any data scientist. R and python are most famously used programming languages used by data scientists but there are a lot more which we will discuss in detail in the coming section. Economists or statisticians are mostly taught STATA and basics of R or python in some universities, which is a good stepping stone but there is a long road ahead in learning how to write R or python scripts that are optimised for agility and speed.

  • Handling data at scale-

Big data and production algorithms call for a highly optimised environment compared to a testing environment where we usually do data cleaning, ML models testing and so on.
This optimised environment comprises choosing the right database or data warehouse to store your data, right machine configuration where script will run, choosing the right libraries and package to do certain tasks in the script, and more such things. Not all these tasks are necessarily performed by data scientists, they are more of a concern for data engineers but as a small company or independent researcher you might not have this luxury, so having a working knowledge will be useful.

  • Detecting patterns & domain knowledge-

These are probably the skills that you should be most proud of when entering the data science field. All other components of data science are tech-heavy but these are more reliant on human intuition and study of the domain, and is still very far from automation. The inherent skill here is to form out of the box hypotheses, look for patterns and back them by data, critically examine every step in the process flow, and connect data and mathematical models to the needs of the industry. These are skills that are inherent in a good economist, mathematician or statistician, and can be put to use to any industry or domain given one is passionate to learn about that domain.

  • Communication & Visualisation-

Communication is a vital skill set for any data scientist. Given the nature of the job, data scientists are required to translate the models or predictions that they’ve built into actionable insights for the non-technical decision makers who will design a course of action on the basis of the data. Good communication is a function of various attributes which we will discuss in the coming article but one tool that really helps with communication is visualisation.

Visualisations come to the rescue when data scientists have to take huge amounts of data and complex models and outputs and make them understandable to an audience with limited time and attention span. As they say, “a picture is worth a thousand words”, and in my opinion, a good visualisation is worth a million. This also comes from the fact that humans inherently understand patterns and symbols better than any other language.

  • Open Source Community-

Data Science community is build on some awesome open source tools, languages, frameworks, and written articles, wikis, blogs and books. Therefore, it is really important that we all contribute back through ways we can- writing blogs, contributing to R, python or any open source language, speaking at seminars, building public data dashboards (like one for COVID19), contributing data to open data repository like OSM or any other way you like.

So, what’s next?

Next up for you, is some reading and hands-on execution. As Thomas Edison famously said-, “Vision without execution is hallucination”.

My co-author Anika Mathur on this blog series and I hope to help you get started on this road of becoming a data scientist. This introductory blog will be followed by 4 more blogs in this series:

  1. Dirty Data- Quality Assessment & Cleaning Measures [Published]
  2. En route to ML- Take statistics & mathematics to next level [In Process]
  3. Scale it up- Programming languages, data storage & collaboration
  4. Spread the knowledge- Communication & Community

Expect these blogs to introduce you to different aspects of data science while giving you trusted resources to start your data science training. Happy hacking!

 

Original post: https://www.datasciencecentral.com/profiles/blogs/data-science-for-analytical-minds-introduction

114 comentários em “Data Science for analytical minds

  1. Hello there! I know this is kinda off topic
    but I was wondering which blog platform are you using for this website?
    I’m getting fed up of WordPress because I’ve had issues with hackers and I’m looking at alternatives for another platform.
    I would be awesome if you could point me in the direction of
    a good platform.

  2. Having read this I thought it was extremely informative.
    I appreciate you finding the time and energy to put this short article together.
    I once again find myself personally spending a significant amount of time both reading and leaving comments.
    But so what, it was still worth it!

  3. Hi I am so glad I found your blog page, I really found you by error, while I was searching on Aol for something else, Nonetheless I am here now and
    would just like to say many thanks for a remarkable post and a all round interesting blog (I also
    love the theme/design), I don’t have time to read through it all at the minute but I have book-marked it and also added your
    RSS feeds, so when I have time I will be back to read much more, Please do keep
    up the awesome b.

  4. Hello there! I could have sworn I’ve visited your blog
    before but after going through a few of the posts I realized it’s
    new to me. Anyhow, I’m certainly pleased I came across it and I’ll be bookmarking it and checking back regularly!

  5. I’m the manager of JustCBD Store brand (justcbdstore.com) and I’m presently planning to expand my wholesale side of business. I am hoping someone at targetdomain give me some advice ! I considered that the most suitable way to do this would be to connect to vape stores and cbd retailers. I was hoping if anybody at all could suggest a trustworthy site where I can get CBD Shops Business Leads I am presently taking a look at creativebeartech.com, theeliquidboutique.co.uk and wowitloveithaveit.com. Unsure which one would be the very best option and would appreciate any assistance on this. Or would it be simpler for me to scrape my own leads? Suggestions?

  6. Today, I went to the beach with my kids. I
    found a sea shell and gave it to my 4 year old daughter and said “You can hear the ocean if you put this to your ear.” She put the shell to her ear and screamed.
    There was a hermit crab inside and it pinched her ear. She never wants to go back!

    LoL I know this is entirely off topic but I had to tell someone!

  7. I’m the proprietor of JustCBD Store company (justcbdstore.com) and I’m presently seeking to expand my wholesale side of business. It would be great if someone at targetdomain give me some advice ! I thought that the most ideal way to accomplish this would be to connect to vape companies and cbd retailers. I was hoping if anybody could suggest a reputable web-site where I can buy CBD Shops B2B Sales Leads I am already taking a look at creativebeartech.com, theeliquidboutique.co.uk and wowitloveithaveit.com. Not sure which one would be the most suitable choice and would appreciate any assistance on this. Or would it be easier for me to scrape my own leads? Ideas?

  8. Good post. I learn something totally new and challenging on blogs I stumbleupon every day. It will always be helpful to read articles from other authors and use a little something from other web sites.

  9. Hello! I simply wish to give you a huge thumbs up for the excellent information you have here on this post. I’ll be coming back to your site for more soon.

  10. May I just say what a comfort to find somebody that genuinely knows what they’re talking about online. You actually understand how to bring a problem to light and make it important. More people really need to check this out and understand this side of your story. I was surprised that you’re not more popular because you certainly have the gift.

  11. Having read this I thought it was rather enlightening. I appreciate you finding the time and energy to put this informative article together. I once again find myself spending a lot of time both reading and posting comments. But so what, it was still worth it!

  12. Hi! I could have sworn I’ve been to this blog before but after browsing through a few of the posts I realized it’s new to me. Regardless, I’m definitely delighted I discovered it and I’ll be bookmarking it and checking back often!

  13. I don’t even understand how I finished up
    here, but I thought this publish was once good. I don’t
    understand who you are but definitely you are going to a famous blogger
    if you happen to aren’t already. Cheers!

  14. I’m impressed, I must say. Rarely do I come across a blog that’s equally educative and engaging, and let me tell you, you’ve hit the nail on the head. The problem is an issue that not enough people are speaking intelligently about. I am very happy I stumbled across this during my hunt for something relating to this.

  15. Nice post. I learn something totally new and challenging on blogs I stumbleupon everyday. It’s always useful to read content from other writers and use something from their sites.

  16. Hi there just wanted to give you a quick heads
    up. The words in your content seem to be running off the screen in Chrome.
    I’m not sure if this is a format issue or something to do with web browser compatibility
    but I figured I’d post to let you know. The design look great though!
    Hope you get the issue fixed soon. Cheers

  17. Your style is very unique compared to other folks I have read stuff from. I appreciate you for posting when you have the opportunity, Guess I’ll just bookmark this site.

  18. Attractive part of content. I just stumbled upon your web site and in accession capital to say
    that I acquire actually enjoyed account your weblog posts.
    Any way I’ll be subscribing for your augment and even I achievement you get entry
    to consistently quickly.

  19. Right here is the right site for anybody who wants to understand this topic. You realize so much its almost tough to argue with you (not that I actually would want to…HaHa). You definitely put a new spin on a topic that’s been discussed for ages. Great stuff, just wonderful!

  20. Good post. I learn something totally new and challenging on websites I stumbleupon everyday. It will always be useful to read through articles from other writers and practice something from other sites.

  21. I’d like to thank you for the efforts you have put in penning this blog. I’m hoping to check out the same high-grade blog posts from you later on as well. In truth, your creative writing abilities has motivated me to get my own, personal website now 😉

  22. I blog frequently and I genuinely thank you for your content. This article has really peaked my interest. I am going to bookmark your blog and keep checking for new details about once per week. I subscribed to your Feed too.

  23. I blog quite often and I truly thank you for your content. The article has really peaked my interest. I’m going to book mark your website and keep checking for new details about once a week. I subscribed to your Feed too.

  24. I need to to thank you for this excellent read!! I certainly enjoyed every bit of it. I have got you bookmarked to check out new things you post…

  25. I need to to thank you for this wonderful read!! I absolutely loved every bit of it. I have you book-marked to look at new stuff you post…

  26. I loved as much as you will receive carried out right here.

    The sketch is attractive, your authored material stylish.
    nonetheless, you command get got an nervousness
    over that you wish be delivering the following. unwell unquestionably come further formerly again as exactly the same
    nearly very often inside case you shield this increase.

    adreamoftrains best hosting

  27. You’re so awesome! I do not believe I’ve read through anything like that before. So good to discover another person with genuine thoughts on this subject. Really.. many thanks for starting this up. This website is one thing that is required on the web, someone with some originality!

  28. Having read this I thought it was extremely informative. I appreciate you taking the time and effort to put this content together. I once again find myself spending way too much time both reading and posting comments. But so what, it was still worth it!

  29. Aw, this was an exceptionally good post. Spending some time and actual effort to create a really good article… but what can I say… I put things off a whole lot and never manage to get nearly anything done.

  30. This is the right blog for anyone who hopes to find out about this topic. You understand so much its almost tough to argue with you (not that I actually would want to…HaHa). You certainly put a fresh spin on a topic which has been discussed for ages. Excellent stuff, just great!

  31. Hi, I do think this is an excellent website. I stumbledupon it 😉 I’m going to return yet again since I book-marked it. Money and freedom is the greatest way to change, may you be rich and continue to help other people.

  32. I blog quite often and I truly appreciate your content. This great article has truly peaked my interest. I’m going to bookmark your website and keep checking for new details about once a week. I opted in for your RSS feed as well.

  33. Howdy, I believe your blog could be having browser compatibility problems. Whenever I look at your site in Safari, it looks fine but when opening in I.E., it has some overlapping issues. I merely wanted to provide you with a quick heads up! Apart from that, great site!

  34. Howdy, I do believe your website may be having web browser compatibility problems. Whenever I look at your site in Safari, it looks fine however, if opening in I.E., it has some overlapping issues. I just wanted to give you a quick heads up! Aside from that, fantastic blog!

  35. You’ve made some really good points there. I looked on the internet to find out more about the issue and found most individuals will go along with your views on this web site.

  36. Oh my goodness! Awesome article dude! Many thanks, However I am having problems with your RSS. I don’t know why I can’t subscribe to it. Is there anyone else having similar RSS issues? Anybody who knows the solution can you kindly respond? Thanks!!

  37. I’m extremely pleased to uncover this web site. I wanted to thank you for ones time for this fantastic read!! I definitely savored every little bit of it and I have you book-marked to see new stuff on your web site.

  38. Oh my goodness! Awesome article dude! Thank you, However I am encountering problems with your RSS. I don’t understand why I am unable to subscribe to it. Is there anyone else having identical RSS problems? Anyone who knows the solution will you kindly respond? Thanks!!

  39. Hi, I do think this is a great web site. I stumbledupon it 😉 I may come back once again since I book-marked it. Money and freedom is the best way to change, may you be rich and continue to help other people.

  40. I’d like to thank you for the efforts you have put in penning this site. I really hope to see the same high-grade blog posts by you in the future as well. In truth, your creative writing abilities has encouraged me to get my own, personal blog now 😉

  41. Achieving your fitness goals doesn’t have to require a certified personal trainer or an expensive gym memberships, it’s not hard to exercise at home. It is easy to go down a training and fitness rabbit hole, however, when you are looking for the best home exercise equipment to outfit your personal home gym.

  42. Hello! Do you know if they make any plugins to assist with Search Engine
    Optimization? I’m trying to get my blog to rank for some targeted keywords but I’m not seeing very good success.
    If you know of any please share. Cheers! y2yxvvfw cheap flights

  43. Hey I know this is off topic but I was wondering if you knew of any widgets I could
    add to my blog that automatically tweet my newest twitter
    updates. I’ve been looking for a plug-in like this
    for quite some time and was hoping maybe you would have some experience with something like this.
    Please let me know if you run into anything.
    I truly enjoy reading your blog and I look forward to your new updates.

    cheap flights 32hvAj4

  44. After looking into a few of the blog posts on your site, I really like your technique of writing a blog. I book marked it to my bookmark website list and will be checking back in the near future. Please check out my website as well and tell me what you think.

  45. Hello there! This article couldn’t be written any better! Looking at this article reminds me of my previous roommate! He always kept preaching about this. I will forward this information to him. Fairly certain he will have a very good read. Thank you for sharing!

  46. You actually make it seem so easy together with your presentation however I find this topic
    to be really something that I believe I might by no means understand.
    It sort of feels too complicated and very huge for me. I’m looking forward to your
    subsequent put up, I will try to get the hold of it!

  47. When I originally commented I appear to have clicked on the -Notify me when new comments are added- checkbox and now each time a comment is added I recieve four emails with the same comment. There has to be an easy method you are able to remove me from that service? Thanks!

  48. I’m extremely pleased to find this page. I wanted to thank you for your time due to this fantastic read!! I definitely appreciated every bit of it and I have you book marked to see new things in your site.

  49. Right here is the perfect blog for everyone who wishes to understand this topic. You realize so much its almost hard to argue with you (not that I really will need to…HaHa). You definitely put a brand new spin on a subject that’s been written about for ages. Excellent stuff, just excellent!

  50. An outstanding share! I’ve just forwarded this onto a co-worker who has been conducting a little homework on this. And he actually ordered me dinner due to the fact that I stumbled upon it for him… lol. So let me reword this…. Thanks for the meal!! But yeah, thanks for spending some time to talk about this issue here on your site.

  51. Right here is the perfect website for anyone who would like to find out about this topic. You realize so much its almost tough to argue with you (not that I actually will need to…HaHa). You definitely put a new spin on a subject that’s been discussed for many years. Wonderful stuff, just great!

  52. After looking into a few of the articles on your website, I really like your technique of writing a blog. I book marked it to my bookmark site list and will be checking back in the near future. Please visit my web site as well and tell me how you feel.

  53. Right here is the right web site for anybody who really wants to understand this topic. You know so much its almost tough to argue with you (not that I personally will need to…HaHa). You definitely put a brand new spin on a subject which has been discussed for many years. Wonderful stuff, just wonderful!

  54. I was very pleased to find this page. I wanted to thank you for ones time for this particularly wonderful read!! I definitely loved every part of it and I have you bookmarked to check out new things in your site.

Leave a Reply

Your email address will not be published. Required fields are marked *