Towards smarter data - accuracy and precision

There is a huge amount of information out there. And it is growing. To make it efficient and increase our competitive advantage we need to evolve and start using information in a smart way, by concentrating on data that drives business value because it is accurate, actionable, and agile. Accuracy is an important measure that determines the quality of data processing solutions.

How accuracy is calculated?

It is easy to do with structured data, because the requirements are formalizable. It is less obvious with unstructured data, e.g. a stream of social feeds, or any data set that involves natural language. Indeed, the sentences of natural language are subject to multiple interpretations, and therefore allow a degree of subjectivity. For example, should a sentence ‘I haven’t been on a sea cruise for a long time’ be qualified for a data set of people interested in going on a cruise? Both answers, yes and no, seem valid.

In these cases an argument was put forward endorsing a consensus approach which polls data providers is the best way to judge data accuracy. This approach essentially claims that attributes with the highest consensus across data providers is the most accurate.

At nmodes we deal with unstructured data all the time because we process natural language messages, primarily from social networks. We do not favor this simplistic approach, as it is considered biased, inviting people to make assumptions based on what they already believe to be true, and making no distinction between precision and accuracy. Obviously the difference is that precision measures what you got right, and accuracy measures both what you got right and what you got wrong. Accuracy is a more inclusive and therefore more valuable characteristic.

Our approach is

a) to validate data against third party independent sources (typically of academic origin) that contain trusted sets and reliable demography. Validating nmodes data against third party sources allows us to verify that our data achieves the greatest possible balance of scale and accuracy.

b) to enrich upon the existing test sets by purposefully including examples ambiguous in meaning and intent, and providing additional levels of categorization to cover these examples.

Accuracy is becoming important when businesses move from rudimentary data use, typical of the first Big Data years, to a more measured and careful approach of today. Understanding how it is calculated and the value it brings helps in achieving long-term sustainability and success.

Digital industry is changing rapidly.

For the last decade analysis of social chatter and capture of consumer sentiment was considered the cutting edge of the marketing strategy. In these early days of the new era of digital information businesses were told to listen to what market is saying about them. They were educated on the importance of media monitoring and the advantages it creates for strategic growth.

This picture has become outdated.

Listening to Big Data, in all its aspects and forms, is no longer enough. After you successfully listened and understood what customer said the next natural step would be to act, or respond. And so the digital domain is now spreading to include responses, with a host of innovative technological solutions reshaping the field rapidly. Advances in artificial intelligence in particular create disruptive scalable opportunities in the space traditionally known for its slow manual progression.

Facebook was among the firstto enter the market, introducing bots into the process of connecting users with brands. Then there was Microsoft's turn.

Following these developments bots became the hottest trend in Silicon Valley in 2016.

nmodes fits seamlessly into this new world order. We deliver AI solutions that power business sales process. Our listening solution accurately monitors and captures real-time needs and interests of individual customers within the defined audience. And our Intelligent Assistant solution brings scalability to responses without compromising on quality.

Towards smarter data - accuracy and precision

Artificial Intelligence Life Chat

The End of Digital Monitoring Paradigm