Portal de Programas de Pós-Graduação (UFABC)

SIGAA - Sistema Integrado de Gestão de Atividades Acadêmicas

PPGCCM PÓS-GRADUAÇÃO EM CIÊNCIA DA COMPUTAÇÃO FUNDAÇÃO UNIVERSIDADE FEDERAL DO ABC Téléphone/Extension: 11 4996-8337 E-mail: poscomp@ufabc.edu.br http://propg.ufabc.edu.br/ppgccm

Banca de QUALIFICAÇÃO: LUIS CESAR DE AZEVEDO

Uma banca de QUALIFICAÇÃO de MESTRADO foi cadastrada pelo programa.
DISCENTE : LUIS CESAR DE AZEVEDO
DATA : 21/06/2021
HORA: 14:00
LOCAL: por participação remota
TÍTULO:

Quantifyingthe Bias-Variance decomposition in property predictions on Materaisl Science

PÁGINAS: 70
GRANDE ÁREA: Ciências Exatas e da Terra
ÁREA: Ciência da Computação
RESUMO:

Most machine learning (ML) applications in quantum-chemistry datasets rely heavily
on a single statistical error parameter such as the mean square error (MSE) to evaluate
their success or failure. However, this approach has limitations or can even yield incorrect
interpretations. Here, we report a systematic investigation of the two components of
the MSE, i.e., the bias and variance, using the quantum-chemistry QM9 dataset as an
example. To do that, we experiment with three state-of-the-art descriptors, namely (i)
Symmetry Functions (SF, with two-body and three-body functions), (ii) Many-body Tensor
Representation (MBTR, with two- and three-body terms), and (iii) Smooth Overlap of
Atomic Positions (SOAP), to evaluate the prediction process’s performance using different
numbers of QM9 molecules in training samples and the effect of bias and variance on
the final MSE. Overall, low sample sizes are related to higher MSE. Moreover, the bias
component strongly influences the larger MSEs. Furthermore, there is little agreement
among molecules with higher errors (outliers) across different descriptors. According to
the obtained results with the distribution of MSE (and its components bias and variance)
and the appearance of outliers, it is suggested to use ensembles of models with a low bias
(in the case of QM9, the best combination uses two versions of MBTR) to minimize the
MSE, more specifically when using a small number of molecules in the training set.

MEMBROS DA BANCA:
Presidente - Interno ao Programa - 1673092 - RONALDO CRISTIANO PRATI
Membro Titular - Examinador(a) Interno ao Programa - 3008017 - DENIS GUSTAVO FANTINATO
Membro Titular - Examinador(a) Interno ao Programa - 1932365 - FABRICIO OLIVETTI DE FRANCA
Membro Suplente - Examinador(a) Interno ao Programa - 2376122 - THIAGO FERREIRA COVOES

Notícia cadastrada em: 19/05/2021 23:05