Preprints (not yet peer-reviewed)

108 downloads
872 views

Today there are no grammar systems, which allow the creation of a fundamentally new word and concept. All existing grammar systems only work by referring to previously chosen terms, on the bases of which all definitions are created. Implementation of operation...

["Artificial Intelligence","Computational Linguistics","Natural Language and Speech"]
doi:10.7287/peerj.preprints.1576v2
187 downloads
408 views

Building an effective team of developers is a complex task faced by both software companies and open source communities. The problem of forming a “dream” team involves many variables, including consideration of human factors, and it is not a dilemma solvable in...

["Data Mining and Machine Learning","Data Science","Natural Language and Speech","Social Computing","Software Engineering"]
doi:10.7287/peerj.preprints.2285v1
87 downloads
297 views

Severe weather impact identification and monitoring through social media data is a good challenge for data science. In last years we assisted to an increase of natural disasters, also due to climate change. Many works showed that during such events people tend...

["Data Science","Emerging Technologies","Natural Language and Speech","Network Science and Online Social Networks"]
doi:10.7287/peerj.preprints.2241v1
78 downloads
221 views

Kamus Dewan is the authoritative dictionary for Bahasa Malaysia, containing a wealth of linguistic and cultural information about Bahasa Malaysia. It is currently available in print, as well as a searchable online dictionary. However, the online dictionary lacks...

["Computational Linguistics","Natural Language and Speech"]
doi:10.7287/peerj.preprints.2205v1
243 downloads
590 views

Due to the rapid development of information technology, Internet has become part of everyday life gradually. People would like to communicate with friends to share their opinions on social networks. The diverse social network behavior is an ideal users' personality...

["Artificial Intelligence","Natural Language and Speech","Social Computing"]
doi:10.7287/peerj.preprints.1906v1
123 downloads
560 views

Developers summarize their changes to code in commit messages. When a message seems “unusual,” however, this puts doubt into the quality of the code contained in the commit. We trained \(n\)-gram language models and used cross-entropy as an indicator of commit...

["Data Mining and Machine Learning","Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1771v1
561 downloads
1,000 views

Automated test generation tools have been widely investigated with the goal of reducing the cost of testing activities. However, generated tests have been shown not to help developers in detecting and finding more bugs even though they reach higher structural coverage...

["Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1467v3
91 downloads
184 views

We present an application of the naturalness of software to provide multi-token code suggestions in GitHub’s Atom text editor. We extended the results of a simple n-gram prediction model using the "mean surprise" metric—the arithmetic mean of the surprisal of several...

["Data Mining and Machine Learning","Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1597v1
116 downloads
311 views

The problem of designing effective methodology to summarize, and analyze the amount of textual information produced by developers remains particularly challenging especially when the goal is to help developers in making better development/maintenance decisions....

["Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1534v1
178 downloads
331 views

Product Data Management (PDM) produced desktop and web based systems to maintain the organizational technical and managerial data to increase the quality of products by improving the processes of development, business process flows, change management, product structure...

["Human-Computer Interaction","Artificial Intelligence","Databases","Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1518v1
199 downloads
354 views

In this paper, we have tried to use statistical machine translation in order to convert Python 2 code to Python 3 code. We use data from two projects and achieve a high BLEU score. We also investigate the cross-project training and testing to analyze the errors...

["Natural Language and Speech","Software Engineering"]
doi:10.7287/peerj.preprints.1459v1
156 downloads
284 views

We describe and experimentally validate a question-asking framework for machine-learned linguistic knowledge about human emotions. Using the Socratic method as a theoretical inspiration, we develop an experimental method and computational model for computers to...

["Agents and Multi-Agent Systems","Artificial Intelligence","Computational Linguistics","Data Mining and Machine Learning","Natural Language and Speech"]
doi:10.7287/peerj.preprints.1292v1
229 downloads
209 views

Despite being a relatively new discipline, Chinese Interpreting Studies (CIS) has witnessed tremendous growth in the number of publications and diversity of topics investigated over the past two decades. The number of doctoral dissertations produced has also increased...

["Data Mining and Machine Learning","Data Science","Databases","Digital Libraries","Natural Language and Speech"]
doi:10.7287/peerj.preprints.1277v1
312 downloads
298 views

Natural Language Processing (NLP) refers to Human-like language processing which reveals that it is a discipline within the field of Artificial Intelligence (AI). However, the ultimate goal of research on Natural Language Processing is to parse and understand language,...

["Computational Linguistics","Natural Language and Speech"]
doi:10.7287/peerj.preprints.1225v1
135 downloads
272 views

We investigated the interplay between overt prosodic cues and semantic cues on the structural interpretation of spoken sentences that permit either high- or low-attachment of a final relative clause. Prosodic cues were manipulated via the presence or absence of...

["Computational Linguistics","Natural Language and Speech"]
doi:10.7287/peerj.preprints.1210v1
What is a PeerJ Preprint?

A PeerJ Preprint is a draft of an article, abstract, or poster that has not yet been peer-reviewed for formal publication. Submit a draft, incomplete, or final version of your work for free.

Submissions today can be approved by Editorial Staff and online in 24 hours.

Establish precedent. Solicit feedback. Publish updates.

Top subject areas - Preprints

Top subject areas - People

View all subject areas