CAP Theorem and Big Data


The following article analyses the applicability of the CAP theorem to Big Data. I will explain the CAP theorem, explore the three of its characteristics, as well as provide the proof of the CAP theorem on an example that is closely related to Big Data use case. I will also briefly discuss couple of possible ways deal with the CAP-related issues in distributed Big Data applications and offer overview of those implementations that best fit each of the CAP properties. [Read more…]

Introduction to NoSQL & Document Data Store

The following article provides a high-level overview of NoSQL databases and the various associated data store types related to these kinds of databases. A particular section of the article is dedicated to a brief summary of the Document Oriented NoSQL databases. I provide example data that illustrate how Document NoSQL database store the data and also outline the most significant differences between the relational type of SQL database and document-oriented NoSQL. [Read more…]

MongoDB and BSON format

Recently I came across a statement that said: “MongoDB (btw. that’s MongoDB) uses the BSON format which extends the JSON model to provide additional data types” and I think this topic deserves a bit of clarification. [Read more…]

Microtargeting, Big Data, and Elections

Microtargeting (also micro-targeting or micro-niche targeting) is one of the methods that is used by the marketing sector to analyze consumer data collected from various sources to detect interests of specific individuals. This collection of data is ordered and classified and later provides the information that is used to influence the thoughts of specific like-minded groups of people. That said, one of the major aims of microtargeting initiatives is to simply identify their target audience to as granular level as possible and also identify target’s preferred communication channel. [Read more…]

Big Data & Deforestation – Use Case


The following use case is my attempt at denoting the importance of Big Data in reference to the world’s largest food companies and their current impact on the overall trend of deforestation in the world. The use case encompasses many of the V’s of Big Data and demonstrates that Big Data are increasingly important to consider, especially in connection to world’s largest food manufacturers and their analysis of the current and future deforestation trends. [Read more…]

Impact of Big Data Volume in the Context of Distributed Data, Scalability, Data Access and Storage


Volume is the most characteristic property of Big Data, which to a large extent affects the other five V’s of Big Data, namely the Velocity, Variety, Veracity, Variability, and Value. In this article, I explore the Volume’s impact primarily in the context of distributed data and scalability, data access and storage, as well as its impact on data transfer.

[Read more…]

‘Volume’ characteristics of Big Data

In the following article, I’ll analyze Big Data from a perspective of high-velocity capture, storing, processing and visualizing the large volumes of data. I’ll provide the examples of applications that create or collect the massive amounts of data, briefly explain the process; and expand on some of the drawbacks that engineers and developers face in the process of analysis and visualization of the Big Data volumes. Also, if you like the below article, don’t forget to check my post on The Significance of Big Data Lifecycle Management (BDLM). [Read more…]

How to Convert a Remote Physical Linux Server to a Virtual Machine (VMware / VirtualBox Instructions)

Recently I needed to convert my physical CentOS Linux 7.3 server to a virtual image that I can quickly run from either VMware Workstation Player or  VirtualBox (or keep as a backup that can be easily switched on as a copy of the physical server). I’ve kept searching the Internet, but only to find that the instructions were either too old or do not provide a proper step by step guide on how to go about this process. Thus the reason for this article, hopefully, it’ll help someone. [Read more…]