EconPapers    
Economics at your fingertips  
 

Groundwater Origin Determination in Historic Chemical Datasets Through Supervised Compositional Data Analysis: Brines of the Permian Basin, USA

Mark A. Engle () and Julien A. Chaput ()
Additional contact information
Mark A. Engle: University of Texas at El Paso, Department of Geological Sciences
Julien A. Chaput: University of Texas at El Paso, Department of Geological Sciences

A chapter in Advances in Compositional Data Analysis, 2021, pp 265-283 from Springer

Abstract: Abstract Data from historic water quality databases often lack critical measurements necessary for focused investigations, such as determining the origin of the water. The U.S. Geological Survey produced waters database contains nearly 7,000 data of good quality for the Permian Basin, the single largest oil-producing province in the United States. However, fewer than 350 of those points contain enough geochemical data (Br concentration or δ18O and δ2H composition) to determine whether the origin of the samples is meteoric water or paleoseawater. Three supervised methods were applied to isometric and pairwise log-ratio transformed major ion data from a subset of samples of known origin but where the Br concentration and δ18O and δ2H composition were excluded to predict origin: linear discriminant analysis (isometric only), support vector machines (isometric and pairwise), and random forests (pairwise only). Error rates from validation, using data of known origin (excluding Br concentration and δ18O and δ2H composition) that were not used in model development, found that no method performed exceptionally well. An ensemble approach of only assigning classification when all three methods provide the same classification reduced the error rate of the validation data to 11% but failed to classify 28% of the data. This latter approach was applied to the nearly 7,000 samples which only contained concentrations of major ions (Cl, Ca, HCO3, Mg, Na, and SO4). Spatial mapping of these newly classified data generated insight on distribution and flow of meteoric and paleoseawater across the basin.

Keywords: Produced water; Discriminant analysis; Random forest; Support vector machines (search for similar items in EconPapers)
Date: 2021
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-030-71175-7_14

Ordering information: This item can be ordered from
http://www.springer.com/9783030711757

DOI: 10.1007/978-3-030-71175-7_14

Access Statistics for this chapter

More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2026-05-12
Handle: RePEc:spr:sprchp:978-3-030-71175-7_14