Preprints (not yet peer-reviewed)

153 downloads
1,084 views

Today there are no grammar systems, which allow the creation of a fundamentally new word and concept. All existing grammar systems only work by referring to previously chosen terms, on the bases of which all definitions are created. Implementation of operation...

["Artificial Intelligence","Computational Linguistics","Natural Language and Speech"]
doi:10.7287/peerj.preprints.1576v3
135 downloads
561 views

Severe weather impact identification and monitoring through social media data is a good challenge for data science. In last years we assisted to an increase of natural disasters, also due to climate change. Many works showed that during such events people tend...

["Data Science","Emerging Technologies","Natural Language and Speech","Network Science and Online Social Networks"]
doi:10.7287/peerj.preprints.2241v2
43 downloads
244 views

Scaling up the analysis of sensitive or confidential documents frequently stumbles on the limited number of individuals with the necessary clearance to access the documents. The availability of cryptographic protocols compatible with text processing methods can...

["Cryptography","Data Science","Natural Language and Speech"]
doi:10.7287/peerj.preprints.2994v1
270 downloads
617 views

Building an effective team of developers is a complex task faced by both software companies and open source communities. The problem of forming a “dream” team involves many variables, including consideration of human factors, and it is not a dilemma solvable in...

["Data Mining and Machine Learning","Data Science","Natural Language and Speech","Social Computing","Software Engineering"]
doi:10.7287/peerj.preprints.2285v1
109 downloads
333 views

Kamus Dewan is the authoritative dictionary for Bahasa Malaysia, containing a wealth of linguistic and cultural information about Bahasa Malaysia. It is currently available in print, as well as a searchable online dictionary. However, the online dictionary lacks...

["Computational Linguistics","Natural Language and Speech"]
doi:10.7287/peerj.preprints.2205v1
314 downloads
797 views

Due to the rapid development of information technology, Internet has become part of everyday life gradually. People would like to communicate with friends to share their opinions on social networks. The diverse social network behavior is an ideal users' personality...

["Artificial Intelligence","Natural Language and Speech","Social Computing"]
doi:10.7287/peerj.preprints.1906v1
153 downloads
665 views

Developers summarize their changes to code in commit messages. When a message seems “unusual,” however, this puts doubt into the quality of the code contained in the commit. We trained \(n\)-gram language models and used cross-entropy as an indicator of commit...

["Data Mining and Machine Learning","Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1771v1
641 downloads
1,089 views

Automated test generation tools have been widely investigated with the goal of reducing the cost of testing activities. However, generated tests have been shown not to help developers in detecting and finding more bugs even though they reach higher structural coverage...

["Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1467v3
124 downloads
234 views

We present an application of the naturalness of software to provide multi-token code suggestions in GitHub’s Atom text editor. We extended the results of a simple n-gram prediction model using the "mean surprise" metric—the arithmetic mean of the surprisal of several...

["Data Mining and Machine Learning","Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1597v1
137 downloads
392 views

The problem of designing effective methodology to summarize, and analyze the amount of textual information produced by developers remains particularly challenging especially when the goal is to help developers in making better development/maintenance decisions....

["Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1534v1
207 downloads
402 views

Product Data Management (PDM) produced desktop and web based systems to maintain the organizational technical and managerial data to increase the quality of products by improving the processes of development, business process flows, change management, product structure...

["Human-Computer Interaction","Artificial Intelligence","Databases","Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1518v1
287 downloads
419 views

In this paper, we have tried to use statistical machine translation in order to convert Python 2 code to Python 3 code. We use data from two projects and achieve a high BLEU score. We also investigate the cross-project training and testing to analyze the errors...

["Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1459v1
228 downloads
333 views

We describe and experimentally validate a question-asking framework for machine-learned linguistic knowledge about human emotions. Using the Socratic method as a theoretical inspiration, we develop an experimental method and computational model for computers to...

["Agents and Multi-Agent Systems","Artificial Intelligence","Computational Linguistics","Data Mining and Machine Learning","Natural Language and Speech"]
doi:10.7287/peerj.preprints.1292v1
312 downloads
249 views

Despite being a relatively new discipline, Chinese Interpreting Studies (CIS) has witnessed tremendous growth in the number of publications and diversity of topics investigated over the past two decades. The number of doctoral dissertations produced has also increased...

["Data Mining and Machine Learning","Data Science","Databases","Digital Libraries","Natural Language and Speech"]
doi:10.7287/peerj.preprints.1277v1
416 downloads
353 views

Natural Language Processing (NLP) refers to Human-like language processing which reveals that it is a discipline within the field of Artificial Intelligence (AI). However, the ultimate goal of research on Natural Language Processing is to parse and understand language,...

["Computational Linguistics","Natural Language and Speech"]
doi:10.7287/peerj.preprints.1225v1
What is a PeerJ Preprint?

A PeerJ Preprint is a draft of an article, abstract, or poster that has not yet been peer-reviewed for formal publication. Submit a draft, incomplete, or final version of your work for free.

Submissions today can be approved by Editorial Staff and online in 24 hours.

Establish precedent. Solicit feedback. Publish updates.

Top subject areas - Preprints

Top subject areas - People

View all subject areas