Alternative Loss Function in Evaluation of Transformer Models

Micha\'nk\'ow, Jakub; Sakowski, Pawe{\l}; \'Slepaczuk, Robert

Alternative Loss Function in Evaluation of Transformer Models

Jakub Micha\'nk\'ow, Pawe{\l} Sakowski and Robert \'Slepaczuk

Abstract: The proper design and architecture of testing machine learning models, especially in their application to quantitative finance problems, is crucial. The most important aspect of this process is selecting an adequate loss function for training, validation, estimation purposes, and hyperparameter tuning. Therefore, in this research, through empirical experiments on equity and cryptocurrency assets, we apply the Mean Absolute Directional Loss (MADL) function, which is more adequate for optimizing forecast-generating models used in algorithmic investment strategies. The MADL function results are compared between Transformer and LSTM models, and we show that in almost every case, Transformer results are significantly better than those obtained with LSTM.

Date: 2025-07, Revised 2025-07
New Economics Papers: this item is included in nep-cmp and nep-for
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
http://arxiv.org/pdf/2507.16548 Latest version (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:2507.16548

Access Statistics for this paper

More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().