Journal of Web Engineering https://journals.riverpublishers.com/index.php/JWE <div class="JL3"> <div class="journalboxline"> <h2>Journal of Web Engineering</h2> </div> <div class="journalboxline">&nbsp;</div> <div class="journalboxline">Web Engineering is the scientific discipline that studies the theory and practice of constructing Web-based systems and applications. This includes theoretical principles and systematic, disciplined and quantifiable approaches towards the cost-effective development and evolution of high-quality, ubiquitously usable Web-based systems and applications. It fundamentally concerns the technology which enables the construction of Web applications. Web Engineering, while rooted in Computer Science and Engineering, draws from a diverse range of other disciplines, such as information science, information systems, management and business, among others.</div> </div> <p>&nbsp;</p> en-US jwe@riverpublishers.com (JWE) biswas.kajal@riverpublishers.com (Kajal Biswas) Sat, 25 May 2024 12:56:59 +0200 OJS 3.3.0.7 http://blogs.law.harvard.edu/tech/rss 60 Data Lake Conceptualized Web Platform for Food Research Data Collection https://journals.riverpublishers.com/index.php/JWE/article/view/24485 <p class="Abstracttext"><span lang="EN-GB">Food research is uniquely intertwined with everyday life and necessitates the utilization of big data. Within this domain, the research data consist of various forms and formats, encompassing biological experiment results, chemical analysis data, nutritional information, microbiological data, sensor data, images, and videos. This diversity stems from the integration of data from various subdomains within the larger field. With recent advancements in deep learning technology, the importance of data has grown significantly, resulting in increased reliance on data-driven research. Although specialized platforms for sharing and utilizing data have been established at the national level, particularly in the bioscience field, food research lacks a dedicated infrastructure and specialized data-sharing platforms. In this study, we develop a platform that leverages Hadoop-based distributed file systems to create a data lake. This platform enables data storage and sharing through a web-based interface. The distributed file system supports scalability by adding data nodes, making it an effective solution for capacity expansion. In addition, the web-based platform ensures high accessibility, allowing users access from anywhere, at any time, using any device. Finally, we introduce the establishment of a 1.8 PB Hadoop-based physical storage system and present an approach for building a highly accessible web platform with substantial utility.</span></p> Gi-taek An, Seyoung Oh, Eunhye Kim, Jung-min Park Copyright (c) 2024 Journal of Web Engineering https://journals.riverpublishers.com/index.php/JWE/article/view/24485 Sat, 25 May 2024 00:00:00 +0200 Application of an Improved Convolutional Neural Network Algorithm in Text Classification https://journals.riverpublishers.com/index.php/JWE/article/view/25003 <p>This paper proposes a text classification model based on a combination of a convolutional neural network (CNN) and a support vector machine (SVM) using Amazon review polarity, TREC, and Kaggle as experimental data. By adding an attention mechanism to simplify the parameters and using the classifier based on SVM to replace the Softmax layer, the extraction effect of feature words is improved and the problem of weak generalization ability of the CNN model is solved. Simulation experiments show that the proposed algorithm performs better in precision rate, recall rate, F1 value, and training time compared with CNN, RNN, BERT and term frequency-inverse document frequency (TF-IDF).</p> Jing Peng, Shuquan Huo Copyright (c) 2024 Journal of Web Engineering https://journals.riverpublishers.com/index.php/JWE/article/view/25003 Sat, 25 May 2024 00:00:00 +0200 Semantically Enriched Keyword Prefetching Based on Usage and Domain Knowledge https://journals.riverpublishers.com/index.php/JWE/article/view/18319 <p>In intelligent web systems [<a href="file:///F:/KAJAL%20DA/Article/JWE/JWE_23-3/JWE_23-3-Article-2/art2.html#bib2">2</a>], web prefetching [<a href="file:///F:/KAJAL%20DA/Article/JWE/JWE_23-3/JWE_23-3-Article-2/art2.html#bib27">27</a>] plays a crucial role. In order to make accurate predictions for web prefetching, it is important but challenging to uncover valuable information from web use statistics [<a href="file:///F:/KAJAL%20DA/Article/JWE/JWE_23-3/JWE_23-3-Article-2/art2.html#bib16">16</a>]. Using statistics and domain expertise, this study presents a new approach dubbed SPUDK for efficient prefetching. In this paper, it is shown how web access logs can be used efficiently for browsing prediction. Our main focus is on the technique needed to manage the queries found in web access logs so that valuable information can be attained. We further process these access logs using a taxonomy and a thesaurus, WordNet, to find the semantics of queries. SPUDK, a system that organises use data into semantic clusters, is one example of this approach. Our contributions in this paper are as follows: (1) A technique to exploit query keywords from access logs. (2) An approach to enrich queries with semantic information. (3) A new similarity measure for finding similarity among URLs present in access logs. (4) A novel clustering technique to find semantic clusters of URLs. (5) Experimental evaluation of the proposed system. The proposed SPUDK system is evaluated using American Online (AOL) logs, which gives improvement of 39% in precision of prediction, 35% in hit ratio and reduction of 50.6% in latency on average as compared to other prediction techniques in the literature.</p> Sonia Setia, Jyoti, Neelam Duhan, Aman Anand, Nikita Verma Copyright (c) 2024 Journal of Web Engineering https://journals.riverpublishers.com/index.php/JWE/article/view/18319 Sat, 25 May 2024 00:00:00 +0200 SPARQL Optimization Using Re-ordering Joining Patterns with Surrogate Key Concept and Subset Patterns https://journals.riverpublishers.com/index.php/JWE/article/view/23925 <p>Semantic web data resides on the web in the form of knowledge graphs known as RDF graphs and searching around the web has been always a crucial task. For the data retrieval of RDF data of the semantic web, SPARQL query language has been used which in turn is based on triple patterns and joins. Optimization of SPARQL query has been a problematic concern for decades due to the large amount of triple patterns associated with RDF data. Although several researchers have put a lot of effort into the optimization of SPARQL query, it is difficult to understand the concept from scratch due to its diversified nature. This paper analyses various optimization techniques for the SPARQL query used with the semantic web to process knowledge graphs. These techniques include join-based, heuristic-based, rule-based, and indexing-based approaches for optimization. This paper will help researchers in this domain to easily get into the core concept of SPARQL execution along with various optimization approaches used for query processing, which can help in various other domains like linked open data and information retrieval. In this paper, an optimization algorithm HSOA (hybrid SPARQL optimization algorithm) has been proposed, which comprises the features of index-based, cost-based, and triple reordering-based optimization approaches. The proposed hybrid algorithm has been designed specifically for n-triple RDF data, which comprises subset patterns, and surrogate key concepts. The results produced by the proposed algorithm are encouraging and have also been tested and compared with the benchmark dataset and SPARQL queries like LUBM, BSBM, and SP2Bench.</p> Rupal Gupta, Sanjay Kumar Malik Copyright (c) 2024 Journal of Web Engineering https://journals.riverpublishers.com/index.php/JWE/article/view/23925 Sat, 25 May 2024 00:00:00 +0200 Enhancing Suggestion Detection in Online User Reviews through Integrated Information Retrieval and Deep Learning Approaches https://journals.riverpublishers.com/index.php/JWE/article/view/23991 <p>In the aftermath of the COVID-19 pandemic, using web platforms as a communication medium and decision-making tool in online commerce has become widely acknowledged. User-generated comments, reflecting positive and negative sentiments towards specific items, serve as invaluable indicators, offering recommendations for product and organizational improvements. Consequently, the extraction of suggestions from mined opinions can enhance the efficacy of companies and organizations in this domain. Prevailing research in suggestion mining predominantly employs rule-based methodologies and statistical classifiers, relying on manually identified features. However, a recent trend has emerged wherein researchers explore solutions grounded in deep learning tools and techniques. This study aims to employ information retrieval techniques for the automated identification of suggestions. To this end, various methodologies, including distance measurement approaches, multilayer perceptron neural networks, support vector machines, regression logistics, convolutional neural networks utilizing TF-IDF, Bag of Words (BOW), and Word2Vec vectors, along with keyword extraction, have been integrated. The proposed approach is assessed using the SemEval2019 dataset to extract suggestions from the textual content of online user reviews. The obtained results demonstrate a notable enhancement in the F<sub>1</sub> score, reaching 0.76 compared to prior research. The experiments further suggest that information retrieval-based approaches exhibit promising potential for this specific task.</p> Zahra Hadizadeh, Amin Nazari , Muharram Mansoorizadeh Copyright (c) 2024 Journal of Web Engineering https://journals.riverpublishers.com/index.php/JWE/article/view/23991 Sat, 25 May 2024 00:00:00 +0200