What a good modern data analyst has to master

Andriy Burkov, ML at Gartner and author of The Hundred-Page Machine Learning Book, shares what a good modern data analyst has to master:

  • Data structures (local and distributed)
  • Data indexing
  • Data privacy and anonymization
  • Data lifecycle management
  • Data transformation (deduplication, handling outliers, and missing values, dimensionality reduction)
  • Data analysis (experiment design, classification, regression, unsupervised methods)
  • Machine learning methods (feature engineering, regularization, hyperparameter tuning, ensemble methods, and neural networks)
  • Computer and database programming, numerical optimization
  • Distributed data processing Real-time and high-frequency data processing
