Data Science in finance and other applications
|Kod Erasmus / ISCED:||
|Nazwa przedmiotu:||Data Science in finance and other applications|
|Jednostka:||Wydział Nauk Ekonomicznych|
Seminaria magisterskie dla II roku programów anglojęzycznych
|Punkty ECTS i inne:||
Seminar is aimed at students interested in conducting advanced empirical research with the use of modern data science and machine learning methods in financial or any other applications..
1. Rules of writing the master's thesis
2. Alternative tools for writing the thesis – LaTeX and RMarkdown
3. Review of available machine learning data databases
4. Model cross-validation techniques
5. Predictive modelling in classification and regression tasks
6. Discussion of various machine learning algorithms: k-Nearest Neighbours, Support Vector Machines / Support Vector Regression, ridge and LASSO, decision and regression trees, bagging, random forests, boosting trees (incl. xgboost), neural networks, model ensembling and stacking, deep learning
7. Feature engineering and feature selection
8. Discussion of various financial applications
9. Discussion of various non-financial applications
10. Students' research topics, research hypotheses
11. Presentation of students' research concepts
Seminar is aimed at students with good understanding of traditional regression models (linear and logistic) and at least intermediate programming experience (preferred R, but python also welcome) interested in conducting advanced empirical research on up-to-date topics with the use of modern data science and machine learning methods in financial or any other applications.
The lecturer offers help in further development of analytical (machine learning) and programming skills, awaited by future employers. Potential joint article in scientific economic journal based on the thesis are also possible.
The seminar will begin with the discussion of the rules of writing the master's thesis, its structure and composition. Alternative tools for writing the thesis, but also articles and effective presentations (LaTeX and RMarkdown) will be presented with practical examples. Students will get to know several available machine learning data databases with real (big) data ready for use in the thesis. Model cross-validation techniques and model quality assessment measures will be discussed. Various machine learning algorithms and their application to predictive modelling in classification and regression tasks will be presented and explained, including the k-Nearest Neighbours, Support Vector Machines / Support Vector Regression, elastic nets – ridge and LASSO regression, decision and regression trees, bootstrap averaging of models, random forests, different algorithms of boosting trees (incl. xgboost), several types of neural networks, model ensembling and stacking and deep learning. And last, but not least, additional topics related to feature engineering, feature selection and resampling methods will be mentioned as well. Students will present and discuss various financial and non-financial applications of predictive modelling based on real data and previous research. Seminar participants will also present their research concepts – selected topic, research framework, hypotheses, results of empirical analyses and the discussion of conclusions
Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani (2017), “Introduction to statistical learning. With Applications in R”, Springer-Verlag.
Hastie Trevor, Robert Tibshirani and Jerome Friedman (2009), “Elements of statistical learning”, Springer-Verlag.
Kuhn Max, Johnson Kjell (2013), “Applied predictive modelling”, Springer-Verlag.
Wickham Hadley, Grolemund Garrett (2017), “R for Data Science”, O'Reilly Media.
|Efekty uczenia się:||
Master thesis using selected machine learning algorithms
KW01, KW02, KW03, KU01, KU02, KU03, KK01, KK02, KK03
|Metody i kryteria oceniania:||
Students’ presentations during seminar meetings, activity in discussions on the presentations of other seminar participants, progress with the master thesis.
Zajęcia w cyklu "Semestr zimowy 2022/23" (w trakcie)
Zajęcia w cyklu "Semestr letni 2022/23" (jeszcze nie rozpoczęty)
Właścicielem praw autorskich jest Uniwersytet Warszawski, Wydział Nauk Ekonomicznych.