Summary: Multivariate Statistics And Machine Learning
- This + 400k other summaries
- A unique study and practice tool
- Never study anything twice again
- Get the grades you hope for
- 100% sure, 100% understanding
Read the summary and the most important questions on multivariate statistics and machine learning
-
1 basics and matrices
This is a preview. There are 53 more flashcards available for chapter 1
Show more cards here -
What is multivariate statistics?
a subdivision of statistics encompassing the simultaneous observation and analysis of more than one outcome variable. -
What is the use of a statistical model in psychology?
- empirical sciences are fraught with uncertainty about relations between variables.
- statistical models help us to quentify this uncertainty
- it helps us to reason about the relations between variables
- it helps us to predict what happens when things change.
-
What can we deduce from a linear model?
- The expected variance of y
- the expected values of y
- the expected covariance of y
- The expected variance of y
-
What is the covariance of two variables?
Basically the correlation, but unstandardized for the sd's of both variables.- if sd of x = sd of y, cor(x,y) = cov(x,y)
-
What is the difference between causal and descriptive models?
- Statistical models may be based on causal reasoning, then the model describes the theory about why two variables are related
- statistical models can be descriptive, then they simply give a summary of the data
- Statistical models may be based on causal reasoning, then the model describes the theory about why two variables are related
-
What are the characteristics of statistical models in multivariate statistics?
- Multivariate data require more sophisticated statistical models, models with multiple independent and dependent variables
- statistical analysis among others are;
- Latent variable models (Factor Analysis), PCA
- Multivariate (normal) probability distribution
- Clustering
- Almost always focused on means and (co-)variances
-
What are the differences between statistics and machine learning?
Statistics vs machine learningCausal modeling -Descriptive modeling
- Focus on inference (
H0 vsHA ) - focus onprediction accuracy Interpretation of modelparameters - Meaning ofparameters ignored
Smaller data sets - large data sets- need for
statistical power Concerned about chance model -Concerned aboutprediction accuracy
-
What is the basic tactic for machine learning?
- The data is split in a training set, on which the model is created, and a training set, used to test the model's accuracy
- this concept is called cross validation, there are different kinds of cross validation.
- The data is split in a training set, on which the model is created, and a training set, used to test the model's accuracy
-
What are the two rules of calculating with the expected value?
- if Y = bX then E(Y) = E(b*X) = b*E(X)
- If Y = X + a then E(Y) = E(X + a) = E(X) + a.
-
What are the rules for simplifying calculations with the expected values when x is squared or a different function?
No rules for other transformations, e.g., E(X^2) or E[g(X)], exist.
- Higher grades + faster learning
- Never study anything twice
- 100% sure, 100% understanding