Purdue University Graduate School
Browse
OPTIMAL_SUBSAMPLING_FOR_MASSIVE_PSIM.pdf (1.77 MB)

OPTIMAL SUBSAMPLING FOR MASSIVE PENALIZED SPINE SINGLE INDEX MODELS

Download (1.77 MB)
thesis
posted on 2019-10-16, 17:02 authored by Haixia SmithsonHaixia Smithson
The semiparametric single index model is well known as a compromise between parametric and nonparametric regression models, with its response mean dependent on a linear combination of covariates through an unknown univariate function. It has been widely studied due to its simplicity and flexibility, yet the challenge of its application exists especially for large datasets. This thesis focuses on the subsampling approach to fit a semiparametric single index models on large datasets, which can be computationally difficult due to the long calculating time and its high requirements on storage memory. By subsampling, the estimation on subsample, called the subsampling estimator, is used to approximate the estimation on the full sample, called the full sample estimator. To obtain an optimal sampling probability for subsampling, i.e., the optimal subsampling method, we first study the asymptotic properties of the subsampling estimator in a general semiparametric single index model with a general subsampling method, then we derive the formula of the optimal sampling probability by minimizing the asymptotic MSE of the subsampling estimator. We consider specific models in simulation studies and real data applications to investigate the numerical performance of the optimal subsampling method.

History

Degree Type

  • Doctor of Philosophy

Department

  • Mathematics

Campus location

  • West Lafayette

Advisor/Supervisor/Committee Chair

Fang Li

Advisor/Supervisor/Committee co-chair

Hanxiang Peng

Additional Committee Member 2

Peijun Li

Additional Committee Member 3

Zuofeng Shang

Additional Committee Member 4

Wanzhu Tu

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC