Hadoop初识
一,前言
Hadoop 官网: http://hadoop.apache.org/
What Is Apache Hadoop?
The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing.
二,详情
1,Hadoop名字是怎么来的?
2,Hadoop的来源?
3,Hadoop 到底是个什么的东西?
The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing.
4,Hadoop能干什么?
5,Hadoop有什么?
The project includes these modules:
- Hadoop Common: The common utilities that support the other Hadoop modules.
- Hadoop Distributed File System (HDFS™): A distributed file system that provides high-throughput access to application data.
- Hadoop YARN: A framework for job scheduling and cluster resource management.
- Hadoop MapReduce: A YARN-based system for parallel processing of large data sets.
三,小结
不论是互联网之际还是未来非常火的机器学习,人工智能基础数据都是根, 而hadoop, spark,strom这些正是分析数据好工具,深入了解这个,对这个工具的熟悉应用又是这个时代it工作者的一项基本功,在这条路上还需深入学习,探索和站在巨人的肩膀上去成长.