Pseudoproxy modelling of uncertainties in palaeoecological data.

Link to slides

Quinn Asena, George Perry, Janet Wilmshurst

University of Auckland, UW-Madison

12/8/22

The problem

Proxy data are the product of multiple sources of uncertainty

  • Environmental processes

    • bioturbation, taphonomy, variable sedimentation rates…
  • Field and laboratory methods

    • core collection methods, sub-sampling strategy, pollen counting…
  • Data processing methods

    • age-depth modelling, interpolation…

The question: is the past recoverable from the data?

Why it matters

  • Palaeoecology moving from descriptive to quantitative

  • Palaeoecology to inform the future requires robust statistical approaches

  • Advances in lab methods, data availability, and statistics are making more inferences possible

What we can do about it

  • One method to assess uncertainties is in simulation

Approach and take-home

  • Simulate core samples containing proxies that mimic the statistical properties of empirical data

  • Simulate uncertainties: process and observer error that affect the data

  • Assess how statistical inferences are affected by those uncertainties

Key concepts

Virtual ecology

Virtual ecology is a framework for assessing sampling and analytical methods in simulation consisting of:

  1. an ecological model that generates synthetic data

    1a. a degradation model

  2. a simulated observational process (a sampling model) that samples the synthetic data

  3. an analytical process or statistical model applied to both sets of data

  4. an assessment of the results

Virtual ecology and empirical ecology

Perfect knowledege, imperfect world

  • known drivers and responses

  • known environmental and observational processes

  • Advantage of benchmark/control data

  • Advantage of replication

  • Able to systematically introduce uncertainty

Perfect world, imperfect knowledge

  • Sampled data with no benchmark/control

  • Advantage of being reality

Proxy system modelling

Describes the process by which environmental change is recorded as an observable signal in an archive:

  1. environmental drivers (e.g., climatic variability)

  2. a sensor (a physical, biological or chemical component of the system that responds to the environmental drivers)

  3. an archive (the medium in which the response of the sensor is recorded such as a lake sediment)

  4. observations drawn from the archive

Proxy Ststem Modelling framework, from Evans et al. (2013)

Pseudoproxy experiments

Borrowing the term “pseudoproxies” from climatology:

  • pseudoproxies are simulated data or modified observational data

  • mimic the statistical properties of empirical data

  • pseudoproxy experiments are similar to virtual ecology

Building the model

Simulating pseudoproxies

We set out to:

  • Represent multiple interacting drivers

  • Include underlying ecological dynamics that can undergo community turnover

  • Generate a multi-species pseudoproxy

  • Recreate core formation processes of accumulation rates and time-span

  • Virtually recreate the observational processes

Simulating pseudoproxies

Partial output of 20 randomly selected species

Ruining pseudoproxies

A. Example species from single replicate

B. Mixing

C. Mixing + sub-sampling

D. Mixing + sub-sampling + proxy counting

Ruined pseudoproxies

Applying the model

Analyses

Ok, now we have generated the data, let’s analyse it. Two analyses:

  • Fisher Information

  • Principal curves

Demonstrating two scenarios with different driving environments.

Scenario 1

Scenario 2

Assessing uncertainties across replicates

Each replicate results in 1210 datasets from the ‘error-free’ to the most uncertain, per scenario 😱.

Across replicates for each of the 1210 datasets:

  1. extract features from the FI and PrC

    • feature analysis reduces the FI and PrC to one dimension
  2. calculate the distance between each dataset from the ‘error-free’ to the most uncertain

  3. make cool visulisations!

Assessing uncertainties across replicates

Quantify the difference between the ‘error-free’ archive and each level of uncertainty. The following is an example from scenario 1 using Fisher Information.

Assessing uncertainties across replicates

Application to empirical

Simulation methods can be integrated with empirical studies to:

  • a priori help shape field sampling methods: e.g., number of core samples (across a region or local replication) required for a given research question.

  • understand the sub-sampling and count resolution required to increase the likelihood of detecting a hypothesised signal in the data.

  • accompany empirical study to test hypotheses about the underlying dynamics that may cause an observed pattern in the data.

  • assess whether inferences made from the data are robust to uncertainty.

What I haven’t covered

“All models are wrong, some are useful” Box (1979)

  • Assessing error rates

  • Chronological uncertainty

  • Extend underlying dynamics to ask specific questions (e.g., resilience)

Acknowledgements

  • George Perry (University of Auckland) and Janet Wilmshurs (University of Auckland, and Manaaki Whenua – Landcare Research)

  • Jack Williams and Tony Ives (University of Wisconsin Madison)

  • Biological heritage Science Challenge (NZ) and the National Science Foundation (USA)

References

Blaauw, Maarten, K. D. Bennett, and J. Andrés Christen. 2010. “Random Walk Simulations of Fossil Proxy Data.” The Holocene 20 (4): 645–49. https://doi.org/10.1177/0959683609355180.
Box, G. E. P. 1979. “Robustness in the Strategy of Scientific Model Building.” In Robustness in Statistics, 201–36. Elsevier. https://doi.org/10.1016/B978-0-12-438150-6.50018-2.
Evans, M. N., S. E. Tolwinski-Ward, D. M. Thompson, and K. J. Anchukaitis. 2013. “Applications of Proxy System Modeling in High Resolution Paleoclimatology.” Quaternary Science Reviews 76 (September): 16–28. https://doi.org/10.1016/j.quascirev.2013.05.024.
Mann, Michael E., and Scott Rutherford. 2002. “Climate Reconstruction Using Pseudoproxies.” Geophysical Research Letters 29 (10): 139-1-139-4. https://doi.org/10.1029/2001GL014554.
Williams, John W., Jessica L. Blois, and Bryan N. Shuman. 2011. “Extrinsic and Intrinsic Forcing of Abrupt Ecological Change: Case Studies from the Late Quaternary.” Journal of Ecology 99 (3): 664–77. https://doi.org/10.1111/j.1365-2745.2011.01810.x.
Zurell, Damaris, Uta Berger, Juliano S. Cabral, Florian Jeltsch, Christine N. Meynard, Tamara Münkemüller, Nana Nehrbass, et al. 2010. “The Virtual Ecologist Approach: Simulating Data and Observers.” Oikos 119 (4): 622–35. https://doi.org/10.1111/j.1600-0706.2009.18284.x.