生物医学データマイニングの国際ジャーナル

生物医学データマイニングの国際ジャーナル
オープンアクセス

ISSN: 2090-4924

概要

Implementation of Decision Tree Using Hadoop MapReduce

Tianyi Yang and Anne Hee Hiong Ngu

Hadoop is one of the most popular general-purpose computing platforms for the distributed processing of big data. HDFS is implementation of distributed file system by Hadoop to be able to store huge amount of data in a reliable way and serve data processing component by Hadoop at the same time. MapReduce is the main processing engine of Hadoop. In this study, we have implemented HDFS and MapReduce for a well- known learning algorithm—decision tree in a scalable fashion to large input problem size. Computational performance with node count and problem size is evaluated.

免責事項: この要約は人工知能ツールを使用して翻訳されたものであり、まだレビューまたは検証されていません。
Top