Jun

Abundance of Information Often is a Liability

A massive change has occurred in the world during the last ten to twenty years. Until recently and throughout the history of mankind information was hard to access. Obtaining and sharing information was either a laborious process or impossible, and the underlying assumption was that information can never be enough.

Today, of course, we have the opposite picture. Not only information is easily available, it keeps pouring in from a growing number of sources, and we continuously find ourselves in situations when there is more information than we want or able to process.

A major task we, as species, are facing is therefore how to reduce or filter out relevant information. It is, to repeat, in direct opposition to the task we’ve been accustomed to during all previous centuries, which was how to obtain information.

Since this change took place only recently, within a lifetime of one generation, we didn’t have time to develop efficient set of procedures to address the new problem. But the work has started and will only accelerate with time.

Interested in reading more? Check out our other blogs:

Amazing Social Data for Travel Companies

                                                   

A huge number of travel related conversations is happening every day on social networks.

Based on nmodes Twitter data (averaged over 1.5 years of observations) there is

- 1 conversation every 15 minutes in which people notify that they are going to NYC;

- 1 conversation every 43 minutes in which people from the USA express intent to go to Europe;

- 1 conversation every 4 minutes with interest or intent to go on vacation;

- 1 conversation every 3 hours in which people are asking for hotel recommendations.

And this is just a tip of the iceberg.

(nmodes currently has 70+ travel-related topics and intents, and growing.)

For travel companies all these are qualified leads, potential customers, and attentive audience.

Reaching out to these potential customers results in a positive consumer experience, brand recognition, and, yes, sales!

READ MORE

Beware the lure of crowdsourced data

Crowdsourced data can often be inconsistent, messy or downright wrong 

We all like something for nothing, that’s why open source software is so popular. (It’s also why the Pirate  Bay exists). But sometimes things that seem too good to be true are just that. 

Repustate is in the text analytics game which means we needs lots and lots of data to model certain  characteristics of written text. We need common words, grammar constructs, human-annotated corpora  of text etc. to make our various language models work as quickly and as well as they do. 

We recently embarked on the next phase of our text analytics adventure: semantic analysis. Semantic  analysis the process of taking arbitrary text and assigning meaning to the individual, relevant components.  For example, being able to identify “apple” as a fruit in the sentence “I went apple picking yesterday” but to  identify “Apple’ the company when saying “I can’t wait for the new Apple product announcement” (note:  even though I used title case for the latter example, casing should not matter)

READ MORE