Blog信息 |
|
blog名称:IDMer (数据挖掘者) 日志总数:175 评论数量:848 留言数量:119 访问次数:2508287 建立时间:2005年6月24日 |
我的相册 |
|

|
联系方式 |
 |
|
| |
公告 |
“数据挖掘者”博客已经搬家,欢迎光临新博客网址:http://idmer.blog.sohu.com 我的新浪微博:@张磊IDMer |
网络日志 |
|
C4.5 Tutorial |
|
|
|
|
|
数据挖掘者 发表于 2006/3/15 20:28:20 |
|
|
|
转载:http://www2.cs.uregina.ca/~dbd/cs831/notes/ml/dtrees/c4.5/tutorial.htmlC4.5是决策树的经典算法,以上链接中对该算法进行了详细描述,并且给出了源程序和应用示例。
References:
P. Winston, 1992.
C4.5 is a software extension of the basic ID3 algorithm designed by Quinlan to address the following issues not dealt with by ID3:
Avoiding overfitting the data
Determining how deeply to grow a decision tree.
Reduced error pruning.
Rule post-pruning.
Handling continuous attributes.
e.g., temperature
Choosing an appropriate attribute selection measure.
Handling training data with missing attribute values.
Handling attributes with differing costs.
Improving computational efficiency.
It is installed for use on Grendel (grendel.icd.uregina.ca), but it may be set up on a local machine as follows:
C4.5 Release 8 Installation Instructions for UNIX
Download the C4.5 source code.
Decompress the archive:
Type "tar xvzf c4.5r8.tar" (not universally supported), or, alternatively,
Type "gunzip c4.5r8.tar.gz" to decompress the gzip archive, and thenType "tar xvf c4.5r8.tar" to decompress the tar archive.
Change to ./R8/Src
Type "make all" to compile the executables.
Put the executables into a "bin" subdirectory and include it in the path for command-line usage.
Manual Pages
c4.5: using the c4.5 decision tree generator.
verbose c4.5: interpreting output generated by c4.5.
c4.5rules: using the c4.5 rule generator.
verbose c4.5rules: interpreting output generated by c4.5rules.
consult: uses a decision tree to classify items.
consultr: uses a rule set to classify items.
Examples
Click on the links below for examples of C4.5 usage:
Example 1 - Golf
A simple, detailed example of how C4.5 and C4.5rules work.
Example 2 - Sunburn
The sunburn example revisited.
Example 3 - Homonyms
Advanced usage of, and a practical application of, C4.5 and C4.5rules. |
|
|
|
阅读全文(22317) | 回复(9) | 编辑 | 精华 |
|
|
|
|
|
|
|
回复:C4.5 Tutorial |
|
|
|
|
|
test(游客)发表评论于2006/3/19 8:04:42 |
|
|
|
感谢对这个看不懂id3,c4.5算法的人的强力帮助和快速回复!如果在半年前看到您这个网站的内容该多好!
还有一个疑惑,对于我这样的初学者(计算机科学教育背景,有编程经验,英语还不错),是中文的资料好呢,还是英文的资料好。适合学习,上手,能够快速学到一定深度(能写开题报告就好:)
赫赫,请多指点!从你这里得到的好东东,比dmresearch都好!看英文的网络资源却一直没有理出个头绪,发现哪些资源好。
看了你的常去网站,非常经典,太有用了,多谢!你参与的dwway我觉得就比dmresearch对我有用,dmresearch东西虽多,不知道是杂还是什么原因,反正用不上,也没学会什么。 |
|
|
|
个人主页 | 引用回复 | 主人回复 | 返回 | 编辑 | 删除 |
|
|
|
|
|
|
| |