Apache Hadoop Terms
New
Terms you face while entering Apache Hadoop. See the list below , If any term
missing , please add it via comment.
HDFS - Hadoop
Distributed File System.
AWS - Amazon Web Services
HPC -
High Performance Computing
IDC - Information Data Corporation.
CDH - Cloudera Distribution For Hadoop
GFS - Google File System
RPM - Redhat Package Manager
UDF - User Defined Functions
Bigdata
Impala
MapReduce(Map and Reduce)
Quest
Cluster
Node
Metadata
ETL(Extract ,Transform, Load)
Key , value , pairs
MapReduce algorithms: Sorting,
searching, indexing, joining data sets,etc.
NameNode,DataNode
TaskTracker,JobTracker
Terabyte, Petabyte
Apache Ambari, Hive , Pig, HCluster, HBase,
YARN, Sqoop, oozie, Flume,
Auro, Chukwa, Cassandra, Zookeeper,
Mahout, Talend, WebHDFS, Whirr,
Hue ,Tez,
Splunk, Bigtop,
Riak,MongoDB
No comments:
Post a Comment