Tag: Apache Giraph
-
Data Modeling in the Big Data Era: HDFS (Part 2)
From discussions that spun off from the last article on “Data Modeling in the Big Data Era,” it became apparent that a discussion of the Hadoop Distributed File System (HDFS) was warranted as this is basically the physical implementation of any Hive, or Impala model, and design considerations here also impact a few security concerns.…
-
Graphs, what are they, and can they help us associate Words with Data?
A network is a collection of things, like computers, that interact with one another. This can be a physical network with routers, switches, and hubs, as well as the World Wide Web where documents are linked to one another. While the documents themselves might be interesting, a vast amount of information can be extracted by…