Thursday 31 October 2013

What is HADOOP ( HDFS and MapReduce)

HADOOP is a software framework that was inspired by Google's Map Reduce and Google File System and now is considered as best solution which can deal with BigData.

When we talk about Big data, it can be anything in the form of picture, movie etc ...and consumes huge amount of space

In Hadoop the storage is provided by HDFS-it provides good way of storage to prevent loss of data in case of failure, and analysis by Map Reduce(data processing) using its own adhoc analysis and runs the query against a huge data and shows the result in a reasonable amount of time.

Saturday 19 October 2013

ADVANTAGES OF CLOUD COMPUTING

Cloud computing offers numerous Advantages .Few of them are given below:

  • Easy access to the information with lower initial investment  since we don’t have to spend much on the hardware, software or licensing fees  and more throughput since everyone(multiple staff can access, share folders and files) can access the cloud through internet  on standard web browsers anytime and can login from anywhere provided they have the required bandwidth.

  • Instead of installing software on our computer directly cloud allows to access files and soft wares through internet. Both small and big companies can benefit from the cloud approach.
 

Tuesday 15 October 2013

Features of Mongo DB

Mongo DB is one of newest database introduced by 10gen .Mongo DB is an open source, on the whole a document oriented Database system and is a part of NoSQL family of database. Despite the fact that it’s not a relational database it has some of the imperative features of RDBMS and has got implausible speed. This DB is used in Projects like Unique Identification Authority of India (UIDAI) , MTV networks and many others

Instead of storing data in tables here rows are replaced by Documents (basic unit of Data in Mongo DB just like a ROW in RDMS) and Collections (collection is a group of documents.) which allow representing complex relationships. It can manage huge amount of data and can load data across a cluster. Mongo DB can perform some features which relational database cannot do.

Below are some of the Features of Mongo DB:
  • Mongo DB supports Map reduce and Aggregation Tools
  • Java Scripts are used instead of Procedures
  • Mongo DB is a schema less Database
  • Most Importantly Mongo DB supports secondary indexes and geospatial indexes.
  • Simple to Administer the Mongo DB in cases of failures
  • Mongo DB designed to provide High Performance
  • MongoDB stores files of any size without complicating your stack.

If you like this post, please share it on google by clicking on the Google +1 button.


Please go through our latest post TOP 6 BIG DATA TRENDS IN THE NEAR FUTURE

Thursday 10 October 2013

Informatica Scenarios:Pivoting of records(Pivoting of Employees by Department)

In this scenario we will discuss about how to pivot a set of records based on column:
Source Records: 
Dept_id
Emp_name
10
CLARK
10
KING
10
MILLER
20
JONES
20
FORD
20
ADAMS
20
SMITH
20
SCOTT
30
WARD
30
TURNER
30
ALLEN
30
BLAKE
30
MARTIN
30
JAMES

Expected Output
DEPT_ID
EMP_NAME
10
CLARK|KING|MILLER
20
JONES|FORD|ADAMS|SMITH|SCOTT
30
WARD|TURNER|ALLEN|JAMES|BLAKE|MARTIN

Related Posts Plugin for WordPress, Blogger...

ShareThis