A MODEL FOR ANALYSING DATA PORTAL PERFORMANCE: THE BIODIVERSITY CASE
Keywords:
Data Portal Performance, Distributed Database Systems, Data Replication ProcessesAbstract
Currently, many museums, botanic gardens and herbariums keep data of biological collections and using computational tools researchers digitalize and provide access to their data using data portals. The replication of databases in portals can be accomplished through the use of protocols and data schema. However, the implementation of this solution demands a large amount of time, concerning both the transfer of fragments of data and processing data within the portal. With the growth of data digitalization in institutions, this scenario tends to be increasingly exacerbated, making it hard to maintain the records updated on the portals. As an original contribution, this research proposes analysing the data replication process to evaluate the performance of portals. The Inter-American Biodiversity Information Network (IABIN) biodiversity data portal of pollinators was used as a study case, which supports both situations: conventional data replication of records of specimen occurrences and interactions between them. With the results of this research, it is possible to simulate a situation before its implementation, thus predicting the performance of replication operations. Additionally, these results may contribute to future improvements to this process, in order to decrease the time required to make the data available in portals.
Downloads
References
Shanping, L., Jiaqi, T.,A collaborative performance tuning approach for Portal-based web sites.
In: Sixth International Conference on Networked Computing and Advanced Information
Management (NCM), 2010, pp.113-117.
Nicola, M. and Jarke, M. 2000. Performance Modeling of Distributed and Replicated Databases.
In: IEEE Transactions on Knowledge and Data Engineering. 12, 4 (2000), 645-672.
Schneiders, A., Van Daele, T., Van Landuyt, W., Van Reeth, W. Biodiversity and ecosystem
services: Complementary approaches for ecosystem management?. In: Ecological Indicators, 21,
pp. 123-133, 2012. doi: 10.1016/j.ecolind.2011.06.021
Halkos, G.E. and Tzeremes, N.G., Measuring biodiversity performance: A conditional efficiency
measurement approach. In: Environmental Modelling & Software. 25, 12 (2010), 1866-1873.
Enkea, N. et al. 2012. The user’s view on biodiversity data sharing — Investigating facts of
acceptance and requirements to realize a sustainable use of research data. In: Ecological
Informatics. 11, (2012), 25-33.
Flemons, P., Guralnick, R., Krieger, J., Ranipeta, A., Neufeld, D. A web-based GIS tool for
exploring the world's biodiversity: The Global Biodiversity Information Facility Mapping and
Analysis Portal Application (GBIF-MAPA). In: Ecological Informatics, 2, 1 (2007), pp. 49-60.
doi: 10.1016/j.ecoinf.2007.03.004.
GBIF. Global Biodiversity Information Network. Available at: .
IABIN. Inter-American Biodiversity Information Network. Available at:
.
Agrawal, R.C. et al., An overview of biodiversity informatics with special reference to plant
genetic resources. In: Computers and Electronics in Agriculture. 84, 92-99, (2012).
DwC. DarwinCore Schema. Available at: .
DiGIR. Distributed Generic Information Retrieval. Available at: .
TAPIR. Tapir TDWG Task Group. Available at: .
Bafnaa, S. et al., Schema driven assignment and implementation of life science identifiers
(LSIDs). In: Journal of Biomedical Informatics. 41, 5 (2008), 730-738.
Salvanha, P., Najm, L. H., Corrêa, P. L. P. and Saraiva, A. M., Model of management and sharing
distributed interaction pollinators information for centralized biodiversity portals, In: Proc. 5th
Contecsi International Conference on Information Systems and Technology Management. São
Paulo, Brazil, 2009.
Schnasea, J.L. et al., Information technology challenges of biodiversity and ecosystems
informatics. In: Information Systems. 28, 4 (2003), 339-345.
Ozsu, T. M., Valduriez, P., Principles of Distributed Database Systems, 2 [S.I.]: Prentice Hall,
Batini, C., Lenzerini, M. and Navathe, S. B., A comparative analysis of methodologies for
database schema integration, In: ACM Comput. Surv., New York, USA, vol. 18, (1986), pp. 323-
Côrrea, P. L. P., Guidelines and procedures for the project database. (Thesis) Department of
Computer Engineering and Digital Systems of the Polytechnic University of São Paulo, 2002 [in
Portuguese].
Nicola, M. and Jarke, M., Performance Modeling of Distributed and Replicated Databases. In:
IEEE Transactions on Knowledge and Data Engineering. 12, 4 (2000), 645-672.
Osman, R. and J., W.K., Database system performance evaluation models: A survey. In:
Performance Evaluation. 69, (2012), 471-493.
Calero, C., Ruiz, J., Piattini, M., Classifying web metrics using the web quality model. In: Online
Information Review, vol. 29, 3(2005), pp. 227-248.
Olsina, L. et al., Using web quality models and a strategy for purpose-oriented evaluations. In:
Journal of Web Engineering. 10, 4 (2011), 316-352.