What do I need to know to be a Data Scientist?
- You need to understand data. Know how to explore it and how to use statistical and analytical techniques
- You need to be able to query and manipulate data sets into required formats using Transact-SQL
- You need to be able to present data in a meaningful way by using tools such as Excel or Power BI.
- You need to understand statistics, and its role in gaining insights from data.
- You need to know how to use a statistical programming language such as R or Python.
- You need to be able to perform data transformation, cleansing and some statistical analysis
- You must understand data science concepts such as machine learning, algorithms , conditional probability etc
- You must be able to create machine learning models, and how to evaluate them
- You must be able to use machine learning to generate predictions and solve problems
- You must learn how to use tools such as Microsoft Azure HDInsight , Scala, Spark etc