Research Article |
|
Corresponding author: Leonardo Salvatori ( leonardo.salvatori@unicam.it ) Academic editor: Irena Axmanová
© 2026 Marco Cervellini, Luciano Ludovico Maria De Benedictis, Leonardo Salvatori, Stefano Chelli, Giandiego Campetella, Federico Selvi, Giovanni Iacopetti, Arianna Ferrara, Alessandro Chiarucci, Antonella Canini, Maura Francioni, Chiara Scalet, Roberto Canullo.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Cervellini M, De Benedictis LLM, Salvatori L, Chelli S, Campetella G, Selvi F, Iacopetti G, Ferrara A, Chiarucci A, Canini A, Francioni M, Scalet C, Canullo R (2026) Training and intercalibration reduce observer-induced variability in forest vegetation surveys. Vegetation Ecology and Diversity 63: e189819. https://doi.org/10.3897/ved.189819
|
The increasing availability of vegetation-plot data results from decades of field surveys conducted by numerous observers. This highlights observer error not as a marginal issue but as a structural component of uncertainty, with direct consequences for estimates of species turnover and temporal trends. Observer-related sampling bias can be investigated through variation in species richness, species composition and abundance. Specifically, pseudoturnover refers to changes in species composition caused by overlooking or misidentification of taxa between sampling events or among different observers surveying the same plot. Although the causes and implications of observer error have been widely discussed, it remains unclear whether observer-related pseudoturnover decreases within observer groups as a result of training. Using data from training and intercalibration sessions carried out in 2023 and 2025 within two forest monitoring programmes in Italy (the LIFE project ModerNEC and the Italian National Forest Inventory), we assessed whether targeted training and collective briefing reduce observer-induced pseudoturnover. We applied Bayesian multilevel models to estimate changes in inter-observer species richness variability and inter-observer dissimilarity. The former decreased across observers in both years, while the latter declined after training when using Jaccard and Euclidean distances in both years; Bray–Curtis dissimilarity decreased only in 2023 and increased in 2025. Overall, training and intercalibration are likely to reduce observer-induced pseudoturnover related to species presence, while variability in abundance estimation needs further study and remains a key challenge for future vegetation monitoring programmes.
Intercalibration, observer error, pseudoturnover, training, vegetation resurvey
There is a large and increasing availability of vegetation plot data, particularly in Europe, covering wide spatial extents as well as long-term temporal gradients, largely resulting from decades of field surveys conducted by hundreds of surveyors. These data are fundamental to understand vegetation patterns and the drivers shaping them, including global changes (
One way to quantify observer error is to compare observations with the true value of the variable of interest. However, in vegetation studies, true values are rarely known because a complete and exhaustive list of species in a given area is often unavailable (
In this context,
Forest ecosystems represent one of the most frequently studied habitats in observer error research (
Using data collected during training and intercalibration sessions carried out within these programmes in Italy, we aim to assess whether targeted training and collective briefing can reduce observer-induced pseudoturnover in forest vegetation surveys. Specifically, we estimate whether inter-observer variability decreases between pre- and post-training exercises in terms of species richness and species composition, the latter measured by three dissimilarity indices.
In 2023, within Action B4 “Training for monitoring operators” of the LIFE project ModerNEC (https://lifemodernec.eu/), four teams, each composed of two botanists, surveyed four 10 m × 10 m forest plots. Observers recorded species presence and percentage cover for herbaceous, shrub and tree layers. The same approach was adopted in 2025 within the action “Training and intercalibration for plant diversity surveyors” of the IFNI 2025 project (
In 2023, the project protocol prescribed the use of the 7-degree Braun-Blanquet cover-abundance scale (
Training and intercalibration activities were structured over three consecutive days and included two survey exercises with the following data entry phase, and a collective briefing session between the surveys. In 2025, a list of species occurring in an area that includes all the plots, produced by a control team, was supplied to the observers. While this was not performed in 2023, control team and observers still assembled a field-based consensus list before the first survey exercise. For both projects, observers were selected on a geographical basis (i.e., Italian administrative regions) and playing with a common species list is a tool to simulate their familiarity with the real field flora. Prior to the surveys, an on-field operations manual was distributed to all observers. The manual described the training survey protocol in detail, outlining the sequence of actions and specific procedures required for plant diversity surveys. The first survey exercise (“pre-training”, EX1) was conducted on the first day, with the observers cycling through the plots. On the second day, data entry was followed by a briefing session, during which observers discussed difficulties and self-perceived errors, including taxonomic issues. The second survey exercise (“post-training”, EX2) consisted of re-surveying the same plots and was carried out on the third day. During the briefing session, observers participated in a “species familiarization” exercise (with plant specimens collected outside the plots) aimed at identifying taxa considered critical for correct identification (
We used Bayesian multilevel models fitted through the ‘brms’ package (
Since the two years represent different training experiences, different structure of the teams, and different vegetation, they are analysed separately.
All analyses and data handling have been performed with R, version 4.5.3 (
In order to measure the effect of training on observer-induced variability in the estimation of species richness, we assessed whether the standard deviation between observers decreased after training. Species richness was modeled as a function of the population-level effects of exercise (EX1 and EX2) and plots (SU), including a group-level effect for observers estimated separately for each exercise. Since the focus was on observer variation, not reliable inference about plot-level richness, SU was chosen as a population-level effect reflecting the differences in species richness among plots. Weakly informative priors were used, together with prior predictive simulations, to constrain outcomes and effects within known bounds (
Since most of the plant biodiversity in temperate forest ecosystems is found in the herbaceous layer (
Dissimilarity was estimated as a function of a population-level effect of exercise and group-level effects of SU and observer, allowing systematic differences in dissimilarity among SUs and observers. Since the dissimilarities take values bounded between 0 and 1, with values near the bounds likely, the beta likelihood was used, with the logit link for the mean and the identity link for the sample size parameter. The beta distribution does not allow exact 0s and 1s, so we subtracted 10-6 from the 1s resulting from scaling the Euclidean distance into this range.
Regression coefficients from generalised linear models cannot usually be interpreted directly, representing a non-linear change in the outcome through the link function (
Species richness showed no consistent change between EX1 and EX2 in 2023 (0.92 species, 95% posterior quantile interval: -4.42–5.79) and 2025 (-1.65 species, 95% posterior quantile interval: -2.87 – -0.41), while the precision of species richness estimates increased across observers in both years. Inter-observer variability was higher before training, as indicated by the posterior distribution of the EX2 - EX1 contrast (Figure
Dissimilarity analyses revealed a general reduction in inter-observer dissimilarity after training when using Jaccard index and Euclidean distance in both years. In contrast, Bray–Curtis dissimilarity decreased only in 2023, whereas there is evidence of an increase in 2025 (Figure
Training and intercalibration substantially reduced inter-observer variability in species richness and dissimilarity in the 2023 and 2025 exercises. In our case study, the briefing session seems to have produced a similar effect on variability reduction in measuring species richness in both 2023 (4 teams with 2 observers each) and 2025 (16 individual observers) exercises.
The reduction in Jaccard-based dissimilarity confirms the briefing session enhanced agreement in species detection, highlighting its effectiveness in reducing overlooking and misidentification errors. Importantly, a similar reduction in variability was observed in 2023, even without a reference species list, indicating that the interactive briefing session per se was sufficient to promote species familiarization and observer alignment (
In contrast, the absence of a consistent reduction in Bray–Curtis dissimilarity indicates that individual differences in abundance estimation persisted despite training, particularly in the 2025 campaign, which involved a larger number of single observers. This pattern was observed even though the 2025 protocol prescribed the use of direct percentage cover estimates, generally considered more robust than Braun-Blanquet classes (
The observed decrease in Euclidean dissimilarity further supports a general convergence among observers, especially in terms of shared detections and non-detections. It has to be stressed again that this distance considers “double zeros” in the same way as “double presences”. Usually, this feature does not allow the use of this index for community comparisons (Orlóci paradox, see
Overall, these results demonstrate that training and intercalibration are effective in reducing observer-induced pseudoturnover related to species presence, while variability in abundance estimation remains an important challenge for future vegetation monitoring programmes. The approach presented in this study addresses the needs of quality assurance for monitoring in forest ecosystems and complies with the requirements and criteria of the ICP Forests Monitoring Programme (
Focusing on key elements of vegetation surveys, we provide some general advice. In programmes covering a wide geographic extent, project planning would assign observers on a geographical basis (e.g., Italian administrative regions). Playing with a common species list is a tool to simulate familiarity with the flora, which is otherwise specific to the survey exercises, providing a baseline to assess remaining biases in terms of species identification and overlooking. Often, the survey protocol requires subdividing the vegetation into vertical layers; this is a potential cause of large discrepancies in cover estimation, therefore the training should focus on correct application of the protocol and all possible edge cases or misinterpretations that could occur. Sampling units differ in both size and structure (e.g., nested or transect) depending on the ecosystem considered, whether it is characterized by denser vegetation with few dominant species (e.g., grassland) or by sparsely distributed species (e.g., shrubland); the training phase should reflect as much as possible all those sampling conditions.
Our results provide evidence that targeted training and intercalibration are effective tools for reducing observer-induced pseudoturnover related to species presence in vegetation surveys, even when observer heterogeneity increases.
Monitoring programmes and resurvey projects should include training and intercalibration exercises as a way to establish a common ground not only in terms of a standard procedure but also of its interpretation, guaranteeing the achievement of programmes’ goals and the expected relative team performances. This is in line with Morrison et al. (
Briefing sessions are a group exercise where observers share their survey experiences allowing them to avoid systematic errors (e.g., protocol misinterpretations) and harmonize judgement (despite personal professional background and knowledge). They consistently improve agreement in species detection, by reducing overlooking and misidentification errors and enhancing the reliability of species richness and presence-absence metrics. In contrast, observer variability in species cover estimation remains largely unresolved, despite the use of direct percentage cover values, indicating that abundance assessment is inherently more subjective and sensitive to observer-specific attributes. These findings highlight that standard training protocols are sufficient to improve detection-related data quality, but not to fully control variability in abundance estimates. Future large-scale vegetation monitoring programmes should therefore complement training with dedicated calibration tools for cover estimation to ensure robust assessments of vegetation change. In the absence of training on plant species cover, we suggest using binary presence/absence data for scientific purposes.
The authors thank the “Follonica Carabinieri Biodiversity Department” (https://rgpbio.it/reparto/follonica/) staff assigned to the “Marsiliana” State Nature Reserve and all the botanists who participated in the training and intercalibration exercises (LIFE project ModerNEC on-field manual https://lifemodernec.eu/documenti/Raccolta_manuali_operazio_i_di_campagna_Rete_NEC_Italia.pdf; IFNI 2025 project on-field manual https://zenodo.org/records/17022354).
Conflict of interest
The authors have declared that no competing interests exist.
Ethical statement
No ethical statement was reported.
Artificial Intelligence (AI) use
Regarding the use of AI in the preparation of this manuscript, the authors declare the following: English proofing for the first draft, then thoroughly revised.
Funding
The New Italian Forest Inventory Project (IFNI2025) was carried out with funding from the Carabinieri Command of Forestry, Environmental and Agri-Food Unit (CUFAA).
Author contributions
Marco Cervellini: Investigation, Conceptualization, Data curation, Software, Visualization, Methodology, Writing – original draft, Writing – review and editing; Luciano Ludovico Maria De Benedictis: Conceptualization, Data curation, Software, Formal analysis, Visualization, Methodology, Writing – original draft, Writing – review and editing; Leonardo Salvatori: Investigation, Writing – review and editing; Stefano Chelli: Investigation, Writing – review and editing; Giandiego Campetella: Investigation, Supervision, Writing – review and editing; Federico Selvi: Investigation, Writing – review and editing; Giovanni Iacopetti: Investigation, Writing – review and editing; Arianna Ferrara: Writing – review and editing; Alessandro Chiarucci: Funding acquisition, Writing – review and editing; Antonella Canini: Funding acquisition, Writing – review and editing; Maura Francioni: Writing – review and editing; Chiara Scalet: Writing – review and editing; Roberto Canullo: Funding acquisition, Investigation, Supervision, Writing – review and editing.
Author ORCIDs
Marco Cervellini https://orcid.org/0000-0002-0853-2330
Luciano Ludovico Maria De Benedictis https://orcid.org/0009-0001-5014-4526
Leonardo Salvatori https://orcid.org/0009-0004-2522-9204
Stefano Chelli https://orcid.org/0000-0001-7184-8242
Giandiego Campetella https://orcid.org/0000-0001-6126-522X
Federico Selvi https://orcid.org/0000-0002-3820-125X
Giovanni Iacopetti https://orcid.org/0000-0002-1472-4435
Arianna Ferrara https://orcid.org/0000-0002-8178-3910
Alessandro Chiarucci https://orcid.org/0000-0003-1160-235X
Antonella Canini https://orcid.org/0000-0003-1132-8899
Maura Francioni https://orcid.org/0009-0005-6839-0548
Chiara Scalet https://orcid.org/0009-0001-3353-5795
Roberto Canullo https://orcid.org/0000-0002-9913-6981
Data availability
The data used in this study is available at https://doi.org/10.5281/zenodo.18710554. The code and notebook reporting the analyses are available at https://doi.org/10.5281/zenodo.19677911.
Examples of the graphical outputs used during the briefing session
Data type: pdf
Explanation note: figure S1: Plot displaying species richness for each observer and sampling unit; figure S2: Plot displaying the cover values for three species (minimum, median and maximum average cover) in a sampling unit; figure S3: Plot displaying herb layer cover for each observer and sampling unit.