Nov

Towards smarter data - accuracy and precision

                                                   

There is a huge amount of information out there. And it is growing. To make it efficient and increase our competitive advantage we need to evolve and start using information in a smart way, by concentrating on data that drives business value because it is accurate, actionable, and agile. Accuracy is an important measure that determines the quality of data processing solutions.

How accuracy is calculated?

It is easy to do with structured data, because the requirements are formalizable. It is less obvious with unstructured data, e.g. a stream of social feeds, or any data set that involves natural language. Indeed, the sentences of natural language are subject to multiple interpretations, and therefore allow a degree of subjectivity. For example, should a sentence ‘I haven’t been on a sea cruise for a long time’ be qualified for a data set of people interested in going on a cruise? Both answers, yes and no, seem valid.

In these cases an argument was put forward endorsing a consensus approach which polls data providers is the best way to judge data accuracy. This approach essentially claims that attributes with the highest consensus across data providers is the most accurate.

At nmodes we deal with unstructured data all the time because we process natural language messages, primarily from social networks. We do not favor this simplistic approach, as it is considered biased, inviting people to make assumptions based on what they already believe to be true, and making no distinction between precision and accuracy. Obviously the difference is that precision measures what you got right, and accuracy measures both what you got right and what you got wrong. Accuracy is a more inclusive and therefore more valuable characteristic.

Our approach is

a) to validate data against third party independent sources (typically of academic origin) that contain trusted sets and reliable demography. Validating nmodes data against third party sources allows us to verify that our data achieves the greatest possible balance of scale and accuracy.

b) to enrich upon the existing test sets by purposefully including examples ambiguous in meaning and intent, and providing additional levels of categorization to cover these examples.

Accuracy is becoming important when businesses move from rudimentary data use, typical of the first Big Data years, to a more measured and careful approach of today. Understanding how it is calculated and the value it brings helps in achieving long-term sustainability and success.

 

Interested in reading more? Check out our other blogs:

Lessons for Businesses from Brazil’s World Cup Disaster

1. Mental, or psychological, state of your team is important: you can put so much pressure on people before they crack. Brazil players didn’t become unqualified professionals overnight. They failed because they were overwhelmed by their country’s expectations, distorted sense of history, and the right to win considered divine. They were too emotionally charged, not in the proper state of mind to compete. So better keep calm, relaxed atmosphere in your team even before launch, or important deadline.

2. Manage customer expectations. Brazil were ramping them up unreasonably. Aggressive messages like the 6th[title] is coming, statements by their coach about two more steps to heaven massively backfired by creating an unhealthy emotional frenzy in the society, which in return influenced the players (see 1.)

3. Logic, organization is the key to successful execution. Germany are not a great team. But they are very well organized. They had a detailed game-plan where every team member knew his task and several different scenarios where prepared. They were able to adjust when the situation on the field changed to squeeze maximum advantage. Sounds simple? That’s because it is. 

READ MORE

The Automation Is Coming

                                                         

A close look at the history of humanity leaves us with no choice but to admit that the majority of jobs as we know them now will be transferred to automated systems. This is part of the technological and scientific progress our civilization is undertaking and it is irreversible.

Artificial intelligence became mainstream in 2016. For the first time artificial intelligence is not only available to big companies like Google, Amazon or Apple, but to the majority of businesses worldwide.  Startups have started building products and services using artificial intelligence en masse.

The essence of artificial intelligence is massive, intuitive computing power: machines so smart that they can learn and become even smarter.  The machines are becoming quicker and more nimble. They cover wider range of conversation topics. They now connect to robotic systems and online interactive systems. There is literally very little they cannot, or will not be able to, do as applied to industrial workforce.

With all the good that’s going to come with automation, we are suddenly faced with a new problem: the elimination of many low and middle class jobs. Many jobs that have already been severely impacted by computers (manufacturing, administrative support, retail, and transportation) will continue to diminish. In the nearest future routine-based jobs (telemarketing, sewing) and work that can be solved by smart algorithms (tax preparation, data entry keyers and insurance underwriters) are most likely to be eliminated.

What to do? It is fruitless to fight automation, we need to find ways to work with automation rather than against it.

The solution is to become more creative as species. Creativity is the natural advantage of humans over machines. Automation is about to change the course of the world, it’s going to be a great disruptor and impact the workforce like nothing we’ve seen before. We can sit around and gradually become obsolete, or accept the challenge and use the tool of creativity, which we are in unique possession of, to maintain our superiority.

 

READ MORE