Pythondata
Data Science activities
- Activities done in data science work, and how they typically occur
- Problem formulation
- Data acquisition
- Data exploration
- Analysis
- Communication of results
- Take action automatically
- Data and metadata
- Data meaning, Data structure (schema), Provenance
- Ethics in data science work
python
data error (5B)
Data Cleaning - A process where the data scientist aims to get a dataset which has fixed the data quality problems in source data
logic data (6A)
ML (9A)
- Output of the training process is a computer system that includes a predictive model
- Supervised learning
- Train-validate-test
- Underfitting and overfitting
- K-nearest-neighbour regression
- Find the choice of one splitting attribute and thresholds that produces the “lowest impurity” division
11 A
- Clustering: k-means clustering
- Content-based
Written on December 6, 2020