Vis enkel innførsel

dc.contributor.authorJuric, Radmila
dc.contributor.authorDanilchanka, Natallia
dc.date.accessioned2021-04-06T12:24:13Z
dc.date.available2021-04-06T12:24:13Z
dc.date.created2021-03-10T12:19:27Z
dc.date.issued2020
dc.identifier.citationJuric, R., Danilchanka, N., & Mousavi, M. G. (2020, January). Reliability of Training Data Sets for ML Classifiers: a Lesson Learned from Mechanical Engineering. In T. X. Bui (Red.), Proceedings of the 53rd Hawaii International Conference on System Sciences (s. 891-900).en_US
dc.identifier.isbn978-0-9981331-3-3
dc.identifier.urihttps://hdl.handle.net/11250/2736402
dc.description.abstractThe popularity of learning and predictive technologies, across many problem domains, is unprecedented and it is often underpinned with the fact that we efficiently compute with vast amounts of data and data types, and thus should be able to resolve problems, which we could not in the past. This view is particularly common among scientists who believe that the excessive amount of data, we generate in real life, is ideal for performing predictions and training algorithms. However, the truth might be quite different. The paper illustrates the process of preparing a training data set for an ML classifier, which should predict certain conditions in mechanical engineering. It was not the case that it was difficult to define and choose classifiers, in order to secure safe predictions. It was our inability to create a safe, reliable and trustworthy training data set, from scientifically proven experiments, which created the problem. This places serious doubts on the way we use learning and predictive technologies today. It remains debatable what the next step should be. However, if in ML algorithms, and classifiers in particular, the semantic which is built-in data sets, influences classifier’s definition, it would be very difficult to evaluate and rely on them, before we understand data semantics fully. In other words, we still do not know how the semantic, sometimes hidden in a data set, can adversely affect algorithms trained by them.en_US
dc.language.isoengen_US
dc.relation.ispartofProceeding of the 53rd Hawaii International Conference on System Sciences (HICSS 2020)
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/deed.no*
dc.titleReliability of Training Data Sets for ML Classifiers: A Lesson Learned from Mechanical Engineeringen_US
dc.typeChapteren_US
dc.description.versionpublishedVersionen_US
dc.source.pagenumber891-900en_US
dc.identifier.doihttps://doi.org/10.24251/HICSS.2020.111
dc.identifier.cristin1896950
cristin.ispublishedtrue
cristin.fulltextoriginal


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal