Portal de Programas de Pós-Graduação (UFABC)

SIGAA - Sistema Integrado de Gestão de Atividades Acadêmicas

PPGCCM PÓS-GRADUAÇÃO EM CIÊNCIA DA COMPUTAÇÃO FUNDAÇÃO UNIVERSIDADE FEDERAL DO ABC Téléphone/Extension: Indisponible E-mail: poscomp@ufabc.edu.br http://propg.ufabc.edu.br/ppgccm

Banca de DEFESA: GUILHERME SEIDYO IMAI ALDEIA

Uma banca de DEFESA de DOUTORADO foi cadastrada pelo programa.
STUDENT : GUILHERME SEIDYO IMAI ALDEIA
Date: 17/11/2025
TIME: 14:30
LOCAL: https://meet.google.com/nnm-cefb-wbd
TITLE:

CURRENT CHALLENGES OF SYMBOLIC REGRESSION: OPTIMIZATION, SELECTION, MODEL SIMPLIFICATION, AND BENCHMARKING

PAGES: 173
BIG AREA: Ciências Exatas e da Terra
AREA: Ciência da Computação
SUMMARY:

Symbolic Regression (SR) is a regression method that aims to discover mathematical expressions
that describe the relationship between variables, and it is often implemented through Genetic
Programming, a metaphor for the process of biological evolution. Its appeal lies in combining
predictive accuracy with interpretable models, but its promise is limited by several long-standing
challenges: parameters are difficult to optimize, the selection of solutions can affect the search,
and odels often grow unnecessarily complex. In addition, current methods must be constantly
re-evaluated to understand the SR landscape. This thesis addresses these challenges through a
sequence of studies conducted throughout the doctorate, each focusing on an important aspect of
the SR search process. First, I investigate parameter optimization, obtaining insights into its role in
improving predictive accuracy, albeit with trade-offs in runtime and expression size. Next, I study
parent selection, exploring ϵ-lexicase to select parents more likely to generate good performing
offspring. The focus then turns to simplification, where I introduce a novel method based on
memoization and locality-sensitive hashing that reduces redundancy and yields simpler, more
accurate models. All of these contributions are implemented into a multi-objective evolutionary
SR library, which achieves Pareto-optimal performance in terms of accuracy and simplicity
on benchmarks of real-world and synthetic problems, outperforming several contemporary
SR approaches. The thesis concludes reimaginating a famous large-scale symbolic regression
benchmark suite, to assess the symbolic regression landscape, demonstrating that our method
achieves Pareto-optimal performance.

COMMITTEE MEMBERS:
Presidente - Interno ao Programa - 1932365 - FABRICIO OLIVETTI DE FRANCA
Membro Titular - Examinador(a) Interno ao Programa - 1673092 - RONALDO CRISTIANO PRATI
Membro Titular - Examinador(a) Interno ao Programa - 3008222 - PAULO HENRIQUE PISANI
Membro Titular - Examinador(a) Externo à Instituição - GISELE LOBO PAPPA - UFMG
Membro Titular - Examinador(a) Externo à Instituição - RENATO TINÓS - USP
Membro Suplente - Examinador(a) Interno ao Programa - 1722875 - DAVID CORREA MARTINS JUNIOR
Membro Suplente - Examinador(a) Externo à Instituição - HELON VICENTE HULTMANN AYALA - PUCPR
Membro Suplente - Examinador(a) Externo à Instituição - EURICO LUIZ PROSPERO RUIVO - UPM

Notícia cadastrada em: 14/10/2025 13:55