According IDC, 80% of all data produced are NoSQL. See:
.png)
There are digital documents, scanned documents, online and offline texts, blob content into SQL, images, videos and audio. Imagine a Corporate Analytics initiative without all these data to analyze and support decisions?
In all the world, many projects are using techonologies to transform these NoSQL data into textual content, to allows analyze it. See:
- Scanned images and images with text extracted using OCR (Google Tesseract is a great option);
- Videos analyzed with Visual Computing supported by Machine Learning (OpenCV is a good option)



.png)

