International Journal of Data Science and Analytics - Call for Papers: Special Issue on Large-scale Statistical Natural Language Processing

Guest Editors

Prof. Dr. Sebastião Pais, University of Beira Interior, Covilhã, Portugal – sebastiao@di.ubi.pt (this opens in a new tab)

Prof. Dr. Gaël Dias, University of Caen Normandie, Caen, France gael.dias@unicaen.fr

Prof. Dr. João Cordeiro, University of Beira Interior, Covilhã, Portugal - jpaulo@di.ubi.pt (this opens in a new tab)

Prof. Dr Mohammed Hasanuzzaman, Cork Institute of Technology, Cork, Ireland - mohammed.hasanuzzaman@cit.ie (this opens in a new tab)

Aims and Scope

Data Scientists work with tons of data, and many times that data includes natural language text. Modern organizations work with vast amounts of data. That data can come in various forms, including documents, spreadsheets, audio recordings, emails, JSON, and so many, many more. One of the most common ways that such data is recorded is via text. That text is usually quite similar to the natural language that we use from day-to-day. Natural Language Processing (NLP) is studying programming computers to process and analyze large amounts of natural textual data. Knowledge of NLP is essential for Data Scientists since it is easy to use and standard containers for storing data. Faced with performing analysis and building models from textual data, one must know how to perform the basic Data Science tasks. That includes cleaning, formatting, parsing, analyzing, visualizing, and modelling the text data. It will all require a few extra steps and the usual way these tasks are done when the data is made up of raw numbers. This kind of data needs NLP, a form of the machine learning algorithm, to analyze its contents. NLP is seen as the next big thing in data analytics that can harness big data to derive information using innovative methods to produce useful insights on current or projected market trends. NLP studies the patterns emerging in the text entries in the big data by analyzing the linguistics and semantics through statistics and machine learning and extracts the significant entities and relationships in what the customers are trying to say posts.

Essentially, instead of focusing on a word or a string of words, NLP comprehensively analyses sentences for their intent. The most common NLP methodologies are automatic summarization, disambiguation, part-of-speech tagging, relations extraction, entity extraction, and, most importantly, natural language understanding and recognition. Although research on NLP is being conducted since quite a few decades, the field has shown significant progress only in the last years. Machine learning methodologies that use NLP are now being deployed extensively across enterprises through their partner big data consulting company. Statistical Natural Language Processing (SNLP) is a field lying in the intersection of natural language processing and machine learning. SNLP differs from traditional natural language processing in that instead of having a linguist manually construct some model of a given linguistic phenomenon, that model is instead (semi-) automatically constructed from linguistically annotated resources, unsupervised and language independent. Methods for assigning part-of-speech tags to words, categories to texts, parse trees to sentences, and so on, are (semi-) automatically acquired using machine learning techniques.

In this special issue, we invite researchers and practitioners, both from academia and industry, from different disciplines and fields such as machine learning, deep learning, natural language processing, statistical learning, data mining, computational linguistics, social network analysis and other related areas to submit novel and significantly extended qualitative and quantitative research papers, that Focus on knowledge discovery in large amounts of data in data science.

Topics of Interest

Statistical Inference
Statistical Parsing
Statistical learning
Text Classification
Question Answering
Sentiment Analysis
Summarization
Conversational agents
Narrative Science
Lexical semantics
Word sense disambiguation
Speech recognition
Text-to-speech and spoken language understanding
Computational Social Science and Social Media
Dialogue and Interactive Systems
Discourse and Pragmatics
Information Extraction
Information Retrieval and Text Mining
Linguistic Theories
Cognitive Modeling and Psycholinguistics
Machine Learning for NLP
Machine Translation
Deep Learning for NLP
NLP Applications in Big Data

Important Dates

Submission system opening: July 1st, 2021
Submission system closing: December 30th, 2021
First review notification: February 30th, 2022
Resubmission of revised manuscript: March 30th, 2022
Final notification due: April 30th, 2022
Camera-ready deadline: May 5th, 2022

International Journal of Data Science and Analytics - Call for Papers: Special Issue on Large-scale Statistical Natural Language Processing

Search

Navigation