Building composite indices in the age of big data - Application to honey bee exposure to infectious and parasitic agents

大数据时代综合指数的构建——以蜜蜂接触传染性和寄生性病原体为例

阅读:1

Abstract

Pollinator insects play a crucial role in maintaining biodiversity and agricultural production worldwide. Yet they are subject to various infectious and parasitic agents (IPAs). To better assess their exposure to IPAs, discriminative and quantitative molecular methods have been developed. These tools produce large datasets that need to be summarised so as to be interpreted. In this paper, we described the calculation of three types of composite indices (numerical, ordinal, nominal) to characterize the honey bee exposure to IPAs in 128 European sites. Our summarizing methods are based on component-based factorial analyses. The indices summarised the dataset of eight IPAs quantified at two sampling times, into synthetic values providing different yet complementary information. Because our dataset included two sampling times, we used Multiple Factor Analysis (MFA) to synthetize the information. More precisely, the numerical and ordinal indices were generated from the first component of MFA, whereas the nominal index used the first main components of MFA combined with a clustering analysis (Hierarchical Clustering on components). The numerical index was easy to calculate and to be used in further statistical analyses. However, it contained only about 20% of the original information. Containing the same amount of original information, the ordinal index was much easier to interpret. These two indices summarised information in a unidimensional manner. Instead, the nominal index summarised information in a multidimensional manner, which retained much more information (94%). In the practical example, the three indices showed an antagonistic relationship between N. ceranae and DWV-B. These indices represented a toolbox where scientists could pick one composite index according to the aim pursued. Indices could be used in further statistical analyses but could also be used by policy makers and public instances to characterize a given sanitary situation at a site level for instance.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。