基于云计算的数据挖掘平台架构及其关键技术研究

发布时间:2013-02-17 作者:丁岩,杨庆平,钱煜明 阅读量:

[摘要] 随着云计算时代的到来,传统数据挖掘系统在海量数据的分析挖掘方面存在性能瓶颈。文章提出了基于云计算的数据挖掘平台,该平台与传统的数据挖掘系统架构相比有高可扩展性、海量数据处理能力、面向服务、硬件成本低廉等优越性,可以支持大范围分布式数据挖掘的设计和应用。该平台能极大减少运营商、企业在数据挖掘技术上的投入并能加快其挖掘业务的推出,缩短研发周期,进一步提高产品收益。

[关键词] 数据挖掘平台;云计算; 数据挖掘云;海量数据

[Abstract] There are performance bottlenecks and scalability problems when a traditional data-mining system is used in cloud computing. In this paper, we present a data-mining platform based on cloud computing. Compared with a traditional data mining system, this platform is highly scalable, has massive data processing capacity, is service-oriented, and has low hardware cost. This platform can support the design and applications of a wide range of distributed data-mining systems. It can greatly decrease the amount of investment needed by telecom operators and enterprises on data mining technologies. It can also shorten the development cycle, speed up the launch of mining services, and improve product revenue.

[Keywords] data mining platform; cloud computing; the cloud of data mining; massive data