Optimization Study on Non-Invasive Prenatal Testing for Y Chromosome Target Concentration Using Box-Cox Transformation and K-Means Clustering
DOI:
https://doi.org/10.54097/dx71v673Keywords:
Non-Invasive Prenatal Testing (NIPT), Box-Cox Transformation, K-means Clustering.Abstract
Non-invasive prenatal testing (NIPT) faces challenges in clinical practice, including difficulties in achieving target Y chromosome concentrations in high-BMI pregnant women and the lack of individualized testing timepoints. This study addresses these two critical challenges by constructing a precise nonlinear regression model and developing a timing optimization strategy. To quantify the nonlinear effects of gestational age and BMI on Y chromosome concentration, data preprocessing was performed, including mean imputation for missing BMI values and linear interpolation for gestational age to ensure data stability and continuity. Pearson correlation analysis revealed a strong positive correlation between Y chromosome concentration and gestational age (r=0.72) and a strong negative correlation with BMI (r=−0.58), both identified as critical influencing factors. To capture nonlinear relationships, this study employed Box-Cox transformation combined with stepwise regression. The constructed nonlinear regression model achieved an R² value of 0.875, with both gestational age and BMI exhibiting extremely significant effects (p=0.000), enabling precise prediction of Y chromosome concentration. Building upon this predictive framework, to determine optimal testing timepoints for pregnant women with varying BMIs, K-means clustering (optimal k=5) was employed to categorize female fetus pregnancies into reasonable groups. Clustering results were validated via ANOVA (F=124.7, p<0.001), demonstrating significant differences and clinical interpretability. Subsequently, the optimal testing window with minimal risk was determined using a piecewise risk function, constrained to ensure a pass rate ≥90%. Results indicate that the optimal timing is delayed with increasing BMI, enabling personalized timing recommendations for high-BMI pregnant women.
Downloads
References
[1] Veuskens J R B, Rossum V M, Cattenstart E, et al. Common haplotypes within the chromosome 1q31.3 region determine systemic concentrations of the entire complement factor H protein family[J]. Journal of innate immunity, 2025, 21-26.
[2] Kumar A, Kumar S. The Role of X and Y Chromosomes in Semen Morphology and Concentration: A Study in Saran, Bihar, India[J]. Journal of Advances in Biology & Biotechnology, 2025, 28(3): 515-523.
[3] Xiao W, Akao S, Okamoto R, et al. The formation of aggregated chromatin/chromosomes in mouse oocytes treated with high concentration of IBMX as a model for a chromosome transfer in human[J]. Systems biology in reproductive medicine, 2024, 70(1): 195-203.
[4] Soberanis C F, Simpson L E, Beckett J A, et al. Near millimolar concentration of nucleosomes in mitotic chromosomes from late prometaphase into anaphase[J]. The Journal of cell biology, 2024, 223(11).
[5] Çift A, Benlioğlu C, Yücel Ö M, et al. A New Sperm Concentration Threshold for Y Chromosome Microdeletion Analysis in Infertile Men: Could It Be Azoopermia?[J]. Urology research & practice, 2024, 50(3): 181-186.
[6] Delinassios G J, Hoffman M R, Koumakis G, et al. Sub-toxic cisplatin concentrations induce extensive chromosomal, nuclear and nucleolar abnormalities associated with high malignancy before acquired resistance develops: Implications for clinical caution[J]. PloS one, 2024, 19(12): e0311976.
[7] Dwi R, Sofiati P, Agesti V S, et al. Preliminary study of chromosome aberrations using Giemsa, two-colour fish, and micronucleus assays in lymphocytes of individuals living in elevated radon concentration areas[J]. Radiation protection dosimetry, 2023, 199(14): 1508-1515.
[8] Feng Y, C F E S, Hao L, et al. Antifungal Tolerance and Resistance Emerge at Distinct Drug Concentrations and Rely upon Different Aneuploid Chromosomes[J]. mBio, 2023, 14(2): e0022723-e0022723.
[9] Jung S L, Akhil K, Z K Y, et al. Concentration of non-myocyte proteins in arterial media of cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy[J]. PloS one, 2023, 18(2): e0281094-e0281094.
[10] Son T T, Ngoc N N, Hien L T T, et al. Screening Y Chromosome Microdeletion in 1121 Men with Low Sperm Concentration and the Outcomes of Microdissection Testicular Sperm Extraction (mTESE) for Sperm Retrieval from Azoospermic Patients[J]. The application of clinical genetics, 2023, 16: 155-164.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







