The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a large cluster, thousands of servers both host directly attached storage and execute user application tasks. By distributing storage and computation across many servers, the resource can grow with demand while remaining economical at every size. We describe the architecture of HDFS and report on experience using HDFS to manage 40 petabytes of enterprise data at Yahoo!
Showing posts with label hdfs. Show all posts
Showing posts with label hdfs. Show all posts
Wednesday, May 15, 2013
Saturday, May 4, 2013
24 Interview Questions & Answers for Hadoop MapReduce developers
A good understanding of Hadoop Architecture is required to understand and leverage the power of Hadoop. Here are few important practical questions which can be asked to a Senior Experienced Hadoop Developer in an interview. This list primarily includes questions related to Hadoop Architecture, MapReduce, Hadoop API and Hadoop Distributed File System (HDFS).
Tuesday, April 30, 2013
Big Data Analytics with Hadoop
A good presentation, it is helpfull from level of beginers to advance...
Sunday, April 28, 2013
Installing Hadoop on Ubuntu (12.04) - single node
--Installing Java
sudo add-apt-repository ppa:webupd8team/java
sudo apt-get update
sudo apt-get install oracle-java7-installer
--Creating user
$ sudo addgroup hadoop
$ sudo adduser --ingroup hadoop hduser
Monday, April 22, 2013
Hadoop Interview Question
1.What is Hadoop
framework?
Answer:
Hadoop is a open
source framework which is written in java by apache software foundation. This
framework is used to write software application which requires to process vast
amount of data (It could handle multi tera bytes of data). It works
in-parallel on large clusters which could have 1000 of computers (Nodes) on the
clusters. It also process data very reliably and fault-tolerant manner.
2.On What concept the Hadoop framework works?
Answer:
It works on MapReduce,
and it is devised by the Google.
3.What is MapReduce ?
Subscribe to:
Posts (Atom)