Tag: Big Data
-
Flafka: Big Data Solution for Data Silos
From the previous post on “Poor Data Management Practices“, Â the discussion ended with a high level approach to one possible solution for data silos. Traditional approaches for solving the data silo problem can cost millions of dollars (even for a moderately sized company), and typically requires a huge effort in integration work (e.g., data modeling,…
-
Poor Data Management Practices
Most people from large organizations can relate to the picture of grain silos representing isolated data stores, and my guess is that even small to medium sized company owners and employees can see their companies evolving to have these silos of data. When you see this, hopefully, it makes you ask yourself, “are we  being…
-
Natural Language Processing and Sentiment Analysis
Why is it there is no “Boring” button for posts on Facebook, Twitter, LinkedIn, and most other social media sites? It would make sentiment analysis so much easier. My opinion, it is to maintain some semblance of decorum and civility, as well as sparing people’s feelings. It’s bad enough when you get a 1,000 views,…
-
NLP – Resume Analysis in R
Natural Language Processing (NLP) is a field of study involving the interaction between computers and the human spoken and written languages, which implies both understanding and communicating. This can become very complicated as you might have already guessed, and many people have simplified it to the point of being about data driven word clouds, or…
-
Predictive Analytics Platform for Overcoming Memory Limitations
IF you’re interested, with emphasis on the IF, I provide an easy to follow step-by-step guide for building a predictive analytics, or data profiling platform utilizing Hadoop, and Spark to overcome some of the memory limitations that come with using a desktop version of R for extremely large data sets. Also, it provides notes that…