Andriy Burkov, ML at Gartner and author of The Hundred-Page Machine Learning Book, shares what a good modern data analyst has to master:
- Data structures (local and distributed)
- Data indexing
- Data privacy and anonymization
- Data lifecycle management
- Data transformation (deduplication, handling outliers, and missing values, dimensionality reduction)
- Data analysis (experiment design, classification, regression, unsupervised methods)
- Machine learning methods (feature engineering, regularization, hyperparameter tuning, ensemble methods, and neural networks)
- Computer and database programming, numerical optimization
- Distributed data processing Real-time and high-frequency data processing