Tag Archives: nosql

Tenzing A SQL Implementation On The MapReduce Framework

Tenzing is a query engine built on top of MapReduce for ad hoc analysis of Google data. Tenzing supports a mostly complete SQL implementation (with several extensions) combined with several key characteristics such as heterogeneity, high performance, scalability, reliability, metadata awareness, low latency, support for columnar storage and structured data, and easy extensibility. Tenzing is currently used internally at Google by 1000+ employees and serves 10000+ queries per day over 1.5 petabytes of compressed data. In this paper, we describe the architecture and implementation of Tenzing, and present benchmarks of typical analytical queries.

(Full Story: Tenzing A SQL Implementation On The MapReduce Framework)

4 Months with Cassandra, a love story | Cloudkick

Advantages of Cassandra
> Linear scalability
> Low operational costs
> Hybrid NoSQL

Administration and operational issues
> nodetool, previously known as nodeprobe
> Major compactions
> Tombstones
> Client reconnection
> Thrift issues

(Full Story: 4 Months with Cassandra, a love story | Cloudkick)

Google BigQuery Service

Google BigQuery Service is a web service that enables you to do interactive analysis of massively large datasets—up to billions of rows. Scalable and easy to use, BigQuery lets developers and businesses tap into powerful data analytics on demand.

(Full Story: Google BigQuery Service)

Migration of Relational Data structure to Cassandra (No SQL) Data structure

Cassandra data structures can be accessed using different programing languages like (.net, Java, Python, Ruby etc.). Data can be directly loaded from the relational databases (like Access, SQL Server, Oracle, MySQL, IBM DB2, etc.) using these programing languages. Custom loaders could be used to load data into Cassandra data structure(s) based on the enactment rules, customization level and the kind of data processing.

(Full Story: Migration of Relational Data structure to Cassandra (No SQL) Data structure)

CouchBase, SQLite launch unified NoSQL query language

UnQL, pronounced “Uncle,” could be considered a “superset” of the SQL syntax, Phillips said. It can parse all statements formulated in the SQL language and supports a number of new operators and expressions as well.

(Full Story: CouchBase, SQLite launch unified NoSQL query language)

NoSQL is a Premature Optimization

Point 1:  NoSQL technologies require more investment than Relational to get going with. 
Point 2:  There is no particular advantage to NoSQL until you reach scales that require it.  In fact it is the opposite, given Point 1.
Point 3:  If you are fortunate enough to need the scaling, you will have the time to migrate to NoSQL and it isn’t that expensive or painful to do so when the time comes.

(Full Story: NoSQL is a Premature Optimization)

GitHub: Kundera – a JPA compliant Object-Datastore Mapping Library for NoSQL Datastores

Overview=========The idea behind Kundera is to make working with NoSQL Databases drop-dead simple and fun. Kundera is being developed with following objectives:
o To make working with NoSQL as simple as working with SQLo To serve as JPA Compliant mapping solution for NoSQL Datastores.o To help developers, forget the complexity of NoSQL stores and focus on Domain Model.o To make switching across data-stores as easy as changing a configuration.

(Full Story: GitHub: Kundera – a JPA compliant Object-Datastore Mapping Library for NoSQL Datastores)

neo4j – open source nosql Graph Database

Neo4j is a high-performance graph engine with all the features of a mature and robust database. The programmer works with an object-oriented, flexible network structure rather than with strict and static tables — yet enjoys all the benefits of a fully transactional, enterprise-strength database. For many applications, Neo4j offers performance improvements on the order of 1000x or more compared to relational DBs.

(Full Story: neo4j – open source nosql Graph Database)

Cassandra Data Model with Example

Cassandra key/value are basically stored in aCassandra column orCassandra super column

(Full Story: Cassandra Data Model with Example)

MongoDB Powering MTV’s Web Properties

Right now MongoDB is powering the recently re-launched spike.com, and we are going to be rolling it out on many other major sites within the next year, most likely including gametrailers.com, thedailyshow.com, comedycentral.com, nick.com, and numerous international properties.

(Full Story: MongoDB Powering MTV’s Web Properties)

Follow

Get every new post delivered to your Inbox.