Until as recently as 5 years ago, text data was considered the little brother of structured data. Consumer insights were mainly based on market research surveys, which often contained several closed-ended questions and one open-ended question. Responses to open questions were coded manually, and insights were mainly derived from structured questions, using the coded open answers to (hopefully) confirm those insights. However, times are changing.
Data is changing: the digital revolution has led to a vast increase in the amount of text data in existence. Words are all around us. Social media, customer service centers, web forms, forums, emails, surveys, the list goes on: so much talk is generated about your brands, products or services across a wide range of channels. The amount of text available often excludes manual coding as an option for analysis, as it is too expensive and time-consuming – and not even very accurate.
At Boobook, clients often ask us how to reveal the wisdom behind their text data. The easy answer to this question: text analytics. However, text analytics can take many forms, and choosing which approach to choose is not a walk in the park. Choosing the right approach depends on different specifications, such as the nature of the source, client needs, data volume, languages used and reporting frequency. An experienced analyst is indispensable in guiding you towards the ideal solution.
The easiest approach is based on word frequency. Visualizations in the form of word clouds offer initial insights into, for example, what people are saying about your brand. Even a ‘simple’ word cloud can become interesting if we link the words to structured data. You can ask these questions:
With text analytics, we can automatically code text into themes and even determine the sentiment of the text (positive, neutral or negative) without a Net-Promoter-Score or an extra question concerning satisfaction. Different levels of human involvement are possible to identify the themes:
With both approaches, we can apply the same coding rule when new data comes in. These methods minimize coder bias (especially topic modeling) and guarantee comparable coding between waves.
We can even take text data analytics a step further and predict specific outcomes. Some examples:
With natural language processing (NLP), grammar and sentence structure also comes into play. NLP allows for a deeper understanding of the language, where words are categorized as nouns, verbs, adjectives, and so on. This information can be used to support the analysis and to add an extra level of context. This technique is mainly applied when you need a responsive model that can interpret questions and give feedback. Nowadays, smart chatbots can recognize human language through natural language processing (NLP) and generate a response.
Text data analytics is a very large container that answers different needs, from the creation of a word cloud to the prediction of a specific future behavior. Each approach has its advantages and its pitfalls. Here at Boobook, we are happy to help you choose the best approach to deriving true wisdom from your text.