A MODEL FOR ANALYSING DATA PORTAL PERFORMANCE: THE BIODIVERSITY CASE

Authors

  • PEDRO LUIZ PIZZIGATTI CORRÊA University of São Paulo, São Paulo
  • PABLO SALVANHA University of São Paulo, São Paulo
  • ANTONIO MAURO SARAIVA University of São Paulo, São Paulo
  • PAULO SCARPELINI NETO São Paulo State University, São José do Rio Preto
  • CARLOS ROBERTO VALÊNCIO São Paulo State University, São José do Rio Preto
  • ROGÉRIA CRISTIANE GRATÃO DE SOUZA São Paulo State University, São José do Rio Preto

Keywords:

Data Portal Performance, Distributed Database Systems, Data Replication Processes

Abstract

Currently, many museums, botanic gardens and herbariums keep data of biological collections and using computational tools researchers digitalize and provide access to their data using data portals. The replication of databases in portals can be accomplished through the use of protocols and data schema. However, the implementation of this solution demands a large amount of time, concerning both the transfer of fragments of data and processing data within the portal. With the growth of data digitalization in institutions, this scenario tends to be increasingly exacerbated, making it hard to maintain the records updated on the portals. As an original contribution, this research proposes analysing the data replication process to evaluate the performance of portals. The Inter-American Biodiversity Information Network (IABIN) biodiversity data portal of pollinators was used as a study case, which supports both situations: conventional data replication of records of specimen occurrences and interactions between them. With the results of this research, it is possible to simulate a situation before its implementation, thus predicting the performance of replication operations. Additionally, these results may contribute to future improvements to this process, in order to decrease the time required to make the data available in portals.

 

Downloads

Download data is not yet available.

References

Shanping, L., Jiaqi, T.,A collaborative performance tuning approach for Portal-based web sites.

In: Sixth International Conference on Networked Computing and Advanced Information

Management (NCM), 2010, pp.113-117.

Nicola, M. and Jarke, M. 2000. Performance Modeling of Distributed and Replicated Databases.

In: IEEE Transactions on Knowledge and Data Engineering. 12, 4 (2000), 645-672.

Schneiders, A., Van Daele, T., Van Landuyt, W., Van Reeth, W. Biodiversity and ecosystem

services: Complementary approaches for ecosystem management?. In: Ecological Indicators, 21,

pp. 123-133, 2012. doi: 10.1016/j.ecolind.2011.06.021

Halkos, G.E. and Tzeremes, N.G., Measuring biodiversity performance: A conditional efficiency

measurement approach. In: Environmental Modelling & Software. 25, 12 (2010), 1866-1873.

Enkea, N. et al. 2012. The user’s view on biodiversity data sharing — Investigating facts of

acceptance and requirements to realize a sustainable use of research data. In: Ecological

Informatics. 11, (2012), 25-33.

Flemons, P., Guralnick, R., Krieger, J., Ranipeta, A., Neufeld, D. A web-based GIS tool for

exploring the world's biodiversity: The Global Biodiversity Information Facility Mapping and

Analysis Portal Application (GBIF-MAPA). In: Ecological Informatics, 2, 1 (2007), pp. 49-60.

doi: 10.1016/j.ecoinf.2007.03.004.

GBIF. Global Biodiversity Information Network. Available at: .

IABIN. Inter-American Biodiversity Information Network. Available at:

.

Agrawal, R.C. et al., An overview of biodiversity informatics with special reference to plant

genetic resources. In: Computers and Electronics in Agriculture. 84, 92-99, (2012).

DwC. DarwinCore Schema. Available at: .

DiGIR. Distributed Generic Information Retrieval. Available at: .

TAPIR. Tapir TDWG Task Group. Available at: .

Bafnaa, S. et al., Schema driven assignment and implementation of life science identifiers

(LSIDs). In: Journal of Biomedical Informatics. 41, 5 (2008), 730-738.

Salvanha, P., Najm, L. H., Corrêa, P. L. P. and Saraiva, A. M., Model of management and sharing

distributed interaction pollinators information for centralized biodiversity portals, In: Proc. 5th

Contecsi International Conference on Information Systems and Technology Management. São

Paulo, Brazil, 2009.

Schnasea, J.L. et al., Information technology challenges of biodiversity and ecosystems

informatics. In: Information Systems. 28, 4 (2003), 339-345.

Ozsu, T. M., Valduriez, P., Principles of Distributed Database Systems, 2 [S.I.]: Prentice Hall,

Batini, C., Lenzerini, M. and Navathe, S. B., A comparative analysis of methodologies for

database schema integration, In: ACM Comput. Surv., New York, USA, vol. 18, (1986), pp. 323-

Côrrea, P. L. P., Guidelines and procedures for the project database. (Thesis) Department of

Computer Engineering and Digital Systems of the Polytechnic University of São Paulo, 2002 [in

Portuguese].

Nicola, M. and Jarke, M., Performance Modeling of Distributed and Replicated Databases. In:

IEEE Transactions on Knowledge and Data Engineering. 12, 4 (2000), 645-672.

Osman, R. and J., W.K., Database system performance evaluation models: A survey. In:

Performance Evaluation. 69, (2012), 471-493.

Calero, C., Ruiz, J., Piattini, M., Classifying web metrics using the web quality model. In: Online

Information Review, vol. 29, 3(2005), pp. 227-248.

Olsina, L. et al., Using web quality models and a strategy for purpose-oriented evaluations. In:

Journal of Web Engineering. 10, 4 (2011), 316-352.

Downloads

Published

2013-01-28

How to Cite

CORRÊA, P. L. P. ., SALVANHA, P. ., SARAIVA, A. M. ., NETO, P. S. ., VALÊNCIO, C. R. ., & DE SOUZA, R. C. G. . (2013). A MODEL FOR ANALYSING DATA PORTAL PERFORMANCE: THE BIODIVERSITY CASE. Journal of Web Engineering, 12(3-4), 232–248. Retrieved from https://journals.riverpublishers.com/index.php/JWE/article/view/4157

Issue

Section

Articles