OSDF project

UConn_Wang

PI: HaiYing Wang · University of Connecticut

Mathematics and statistics

As the size of data explodes during the big data era, we develop a strategy to select more informative data points for building models to alleviate the computation burden. In contrast to previous studies on parametric models, our research explores the efficacy of optimal subsampling methods in gradient boosting trees, a semi-parametric method.

1.9 TB

Data delivered over the OSDF

4,000

Jobs

4.7K

Files via OSDF

18.3K

CPU hours

0

GPU hours

Cumulative usage · Jul 2, 2025 – Jul 2, 2026

Get involved

Bring your data onto the fabric.

Request an access point and connect your first repository in an afternoon — facilitation is free.