ARES Analysis

ARES Analysis is nonprofit NLP research project developing information extraction system that process, analyzes and aggregates news from BBC and Reuters on daily basis. It extracts triples in the form: subject(noun) - relation(verb) - object (noun). For example sentence "Trump meets Putin" is extracted as Trump(subject) -> meets (verb relation) -> Putin (object). The software is being written in Java.

Extraction is based on chunking (Shallow Parsing), so resulting model must be improved in the future with Deep Parsing approach. Shallow Parsing processes text as flat structure (not tree structure that is domain of Deep Parsing). Regarding extraction data, you can choose between chunks and superChunks. Chunks are less acurate then superChunks in regard to searching terms but contain more context data then superChunks.

ARES Analysis gives you opportunity to perform search on news data in different ways. For example you can search for every place,person,event etc. (object) that U.S. President Donald Trump visits/visited/will visit if you type subject:Trump verb relation: visit. Or you can search for all interactions/verb relations between EU and China if you type subject: EU object:China. If you want to perform just simple full text search on all news sentences in database, just type something into "Sentence" field.

If you want to use logical operators in search boxes (subject, verb relation, object, sentence) you can use AND OR operators. For example: Trump OR Putin visit OR meet Merkel AND Macron. Each field can contain either AND or OR operator but not both.

ARES Analysis uses combination of custom ARES algorithm and Recursive Neural Network for sentiment analysis of sentences. It means you can search for example for all data that contains verb relation "invest" and have positive sentiment. Please be aware that sentiment data are not 100% accurate because our sentiment analysis model is still in the process of training. Sentiment value of sentences represents rather sentiment context than direct sentiment value between two entities. This is especially valid for sentences that contain more than two entities/nouns. The other important fact about sentiment is that it is to the certain degree subjective value. What is positive for one entity can be negative for other entity. The subjectivity in this system is based on the European "point of view". For example sentence "Russia sends special forces into Syria to help stabilize Assad's regime" is obviously positive for Russia and Syria but negative in this system as widening of Russian power in the Middle East is not in European interest. The simplest sentences for sentiment analysis are sentences about economic, diplomatic, political, cultural and other forms of cooperation. They are evaluated as positive. The exception are sentences about cooperation between/among "rogue actors" (like ISIS, Al Qaeda, Syria, Iran etc.)

Ares Frequency Relations Matrix: displays N-ary relations with frequencies among entities in adjacency matrix (graph theory)

Last extraction algorithm runs:

BBC --> processed sentences: 0, processed URLs: 139, finished time: 2021-12-13-08-51-09
Reuters --> processed sentences: 422, processed URLs: 59, finished time: 2021-12-31-08-18-43
Subject Verb relation Object Sentence
Select sentiment Select table Date (MM-DD)