| Description |
A multi-stage ensembled machine learning model was developed to estimate metal abundances in seafloor massive sulfide deposits worldwide. The modeling framework integrates (1) KMeans++ clustering to identify geochemical groupings based on enrichment controls, (2) Random Forest classification to assign geochemical labels to vent fields with incomplete or absent geochemical data, and (3) XGBoost regression to generate high-fidelity predictions of metal concentrations. This USGS model application data release includes all scripts, input files, and output files necessary to apply the model to estimate concentrations of cobalt, gold, and zinc. This model is not limited by spatial boundaries and is intended for application to any oceanic location with appropriate input data. [More]
|