Data Analysis: What are the skills needed to become a data analyst?
2012-11-21 17:20
435 查看
今天看到一个讨论贴:如何成为一个牛逼的数据分析师?其中有一贴如此总结。全文如下:
There are two classes of skills that are needed to be a successful data analyst: both soft and technical skills are needed. The core work flow for a data analyst is several fold. Once a problem has been defined, and a hypothesis is to be tested, the data must be drawn out and then analyzed. The resulting analysis is written up and communicated to the interested stake holder. In order to do this there are several hard and soft skills that are required.
Technical Skills:
A basic knowledge of statistics to a rigorous understanding of Machine Learning. Most consumers of analysis will not look at more than descriptive analysis (means, medians, significance).
Computer skills that are useful are a Querying Language (SQL,Hive,Pig), a scripting Language (Python,Matlab), a Statistical Language (R, SAS, SPSS), and a Spreadsheet (Excel).
Soft Skills
Defining the problem and narrowing the analysis down often requires a lot of soft skills. Balancing the demands on your time to reduce infinite what-if scenarios and understanding the requestors needs requires good communication and understanding of the business needs. Avoid agreeing to delivering too much information that will be not useful to solving the core issues.
Knowing the audience. There is a different presentation required for a PM or a CEO. As a Data Analyst, you will be often required to answer to both. A typical PM will want a more collaborative interaction with more scenarios spelled out and a less polished presentation. A CEO will often be looking for a specific recommendation in a small polished presentation.
Delivery. Having a wonderfully accurate predictive model, that has been backtested to deliver a low RMSE, or an AB test that can increase conversion 15% without reducing sales price are all great results. However, without a great presentation key findings may be left out of product road maps and in the backlog for months or years.
看完后,总结一下作者的意思,无非两层:一是要有干货,二是要有思维。干货包括对机器学习的理解,通一门查询语言(SQL,Hive或者Pig),通一门脚本语言(python或者Matlab),通一门统计语言(R,SAS或者SPSS),通一款软件(Excel)。思维包括, 明确问题的核心,理解客户心声,最后一个没读懂!英文有限,翻译有误,请找有道。
There are two classes of skills that are needed to be a successful data analyst: both soft and technical skills are needed. The core work flow for a data analyst is several fold. Once a problem has been defined, and a hypothesis is to be tested, the data must be drawn out and then analyzed. The resulting analysis is written up and communicated to the interested stake holder. In order to do this there are several hard and soft skills that are required.
Technical Skills:
A basic knowledge of statistics to a rigorous understanding of Machine Learning. Most consumers of analysis will not look at more than descriptive analysis (means, medians, significance).
Computer skills that are useful are a Querying Language (SQL,Hive,Pig), a scripting Language (Python,Matlab), a Statistical Language (R, SAS, SPSS), and a Spreadsheet (Excel).
Soft Skills
Defining the problem and narrowing the analysis down often requires a lot of soft skills. Balancing the demands on your time to reduce infinite what-if scenarios and understanding the requestors needs requires good communication and understanding of the business needs. Avoid agreeing to delivering too much information that will be not useful to solving the core issues.
Knowing the audience. There is a different presentation required for a PM or a CEO. As a Data Analyst, you will be often required to answer to both. A typical PM will want a more collaborative interaction with more scenarios spelled out and a less polished presentation. A CEO will often be looking for a specific recommendation in a small polished presentation.
Delivery. Having a wonderfully accurate predictive model, that has been backtested to deliver a low RMSE, or an AB test that can increase conversion 15% without reducing sales price are all great results. However, without a great presentation key findings may be left out of product road maps and in the backlog for months or years.
看完后,总结一下作者的意思,无非两层:一是要有干货,二是要有思维。干货包括对机器学习的理解,通一门查询语言(SQL,Hive或者Pig),通一门脚本语言(python或者Matlab),通一门统计语言(R,SAS或者SPSS),通一款软件(Excel)。思维包括, 明确问题的核心,理解客户心声,最后一个没读懂!英文有限,翻译有误,请找有道。
相关文章推荐
- What are the best talks/lectures related to big data/algorithms/machine learning?
- What are the most important data structure and algorithms to prepare for Google Interview?
- What are the Data types of the external layer?
- What to do when the Chinese Characters are messed up when extracting from zip archive?
- server 2 task(s) are sleeping waiting for space to become available in the log segment for database tempdb.
- What are the numbers that every computer engineer should know, according to Jeff Dean?
- What are the practical differences between SVD and wavelet transforms in data analysis?
- What are some of the differences between using recursion to solve a problem versus using iteration?
- 【转】What CLASSPATH settings are needed to run JUnit?
- add cool toolbar 256corler is easy,only two file and onCreate func is needed also the MainFrame.toolbar must resize to what real need and also the load width is same to it.
- What are the lesser known but useful data structures
- 73 Which two are the prerequisites to enable Flashback Data Archive? (Choose two.) A. Database must
- What are the lesser known but useful data structures?
- SQL The data types ntext and varchar are incompatible in the not equal to operator 错误
- MISCONF Redis is configured to save RDB snapshots, but is currently not able to persist on disk. Commands that may modify the data set are disabled. Please check Redis logs for details about the error
- What are the best ways to learn self discipline?
- What are the best ways to determine what port an application is using?
- What are the lesser known but useful data structures?
- What skills are needed for machine learning jobs
- MISCONF Redis is configured to save RDB snapshots, but is currently not able to persist on disk. Commands that may modify the data set are disabled. Please check Redis logs for details about the error