 MACHINE LEARNING FOR ECONOMICS | Università degli studi di Bergamo - Didattica e Rubrica >

# MACHINE LEARNING FOR ECONOMICS

Modulo Generico
Codice dell'attività formativa:
149009-E2

## Scheda dell'insegnamento

Per studenti immatricolati al 1° anno a.a.:
2022/2023
Insegnamento (nome in italiano):
MACHINE LEARNING FOR ECONOMICS
Insegnamento (nome in inglese):
MACHINE LEARNING FOR ECONOMICS
Insegnamento:
Tipo di attività formativa:
Attività formativa Caratterizzante
Tipo di insegnamento:
Obbligatoria
Settore disciplinare:
STATISTICA (SECS-S/01)
Anno di corso:
1
2022/2023
Crediti:
6
Responsabile della didattica:
Mutuazioni
• Corso di studi in ECONOMICS AND DATA ANALYSIS - Percorso formativo in PERCORSO COMUNE

## Altre informazioni sull'insegnamento

Ciclo:
Secondo Semestre
Obbligo di frequenza:
No
Ore di attività frontale:
48
Ore di studio individuale:
102
Ambito:
Statistico-matematico
Testi e materiali didattici:

#### Course Syllabus

Prerequisites

- Good knowledge of the fundamentals of Statistics (i.e. probability, inferential statistics, linear regression model).
- Basic knowledge of the R programming language (as given by the "Coding for data science" module 1).

Educational goals

The course aims at providing the knowledge of cutting-edge statistical tools for modeling complex data. In particular, the objective of the considered methods is the automatic detection of patterns in the data (i.e. to “learn” from data). The estimated models can then be used by the analysts to make accurate predictions and take decisions under uncertainty.

At the end of the course the student will gain the ability to:
a) choose and apply the appropriate statistical tool, in the class of statistical learning methods, for the analysis of different types of data coming from real-world problems;
b) use the open-source statistical software R (freely available for download at http://www.r-project.org) for performing data analysis and visualization, implementing statistical models and obtaining predictions;
c) interpret the results in a decision making perspective.

Course content

- Introduction to machine learning: supervised versus unsupervised learning, the bias-variance trade-off.
- Classification methods: K-nearest neighbors classification, logistic regression, naive Bayes, linear and quadratic discriminant analysis, classification trees (including bagging, random forests, boosting), support vector machine.
- Regression methods: K-nearest neighbors regression, ridge and lasso regression, non-linear regression models, regression trees (including bagging, random forests, boosting), support vector machine.
- Resampling methods: cross-validation and bootstrap.

Teaching methods

The course consists in theory lectures for a total of 48 hours. Extra hours (usually 12) are dedicated to R lab sessions. The lectures/labs calendar will be published at the beginning of the course on the Moodle page of the course.

Assessment and Evaluation

The exam consists in:
- a test including open-ended and T/F questions concerning theoretical topics or short applications of the studied methods;
- exercises to be solved using the R software in order to evaluate the ability of the student in analysing data and interpreting outputs.

The two parts of the exam (theoretical and practical) are each worth 50% of the total score, approximately.
This course represents the second module of the “CODING AND MACHINE LEARNING” course (12 cfu). The final score will be computed by averaging the grades obtained from the two modules (Coding for Data Science and Machine Learning for Economics). The final scores will be published in the e-learning page of the course.

Further information