Wednesday, 13 November 2013

Terms you come across in Apache Hadoop



Apache Hadoop Terms
                                New Terms you face while entering Apache Hadoop. See the list below , If any term missing , please add it via comment.
                HDFS     -              Hadoop Distributed File System.
                AWS      -              Amazon Web Services
                HPC       -              High Performance Computing
                IDC         -              Information Data Corporation.
                CDH       -              Cloudera Distribution  For Hadoop
                GFS        -              Google File System
                RPM      -              Redhat Package Manager
                UDF       -              User Defined Functions
                Bigdata
                Impala
                MapReduce(Map and Reduce)
                Quest
                Cluster
                Node
                Metadata
                ETL(Extract ,Transform, Load)
                Key , value , pairs
   MapReduce algorithms: Sorting, searching, indexing, joining data sets,etc.

  NameNode,DataNode
 
 TaskTracker,JobTracker

              Terabyte, Petabyte

Apache Ambari, Hive , Pig, HCluster, HBase,
YARN, Sqoop, oozie, Flume, 
Auro, Chukwa, Cassandra, Zookeeper,   
Mahout, Talend, WebHDFS, Whirr, 
Hue ,Tez, Splunk, Bigtop,
Riak,MongoDB

No comments:

Post a Comment