DTREG
  • DTREG is the ideal tool for modeling business and medical data with categorical variables such as sex, race and marital status.

  • Decision trees present a clear, logical model that can be understood easily by people who are not mathematically inclined.

  • If you have a need for linear or nonlinear regression analysis, check out the NLREG program.

  • You also should check out the News Rover program that automatically scans Usenet newsgroups, downloads messages of interest to you, decodes binary file attachments, reconstructs files split across multiple messages, and eliminates spam and duplicate files. News Rover also has a built-in MP3 music search engine and can quickly locate music files on any Usenet newsgroup.

    DTREG Benchmarks


    The following table shows the results for various types of predictive models applied to a large number of benchmarks. Note that different types of models work best for different types of data.

    The following predictive models were tested using DTREG:

    • Single tree - Classical single decision trees.
    • TreeBoost - DTREG implementation of Jerome Friedman's Stochastic Gradient Boosting ("MART").
    • Decision Tree Forests - DTREG implementation of Leo Breiman's "Random Forest"(tm) algorithm.
    • SVM - Support Vector Machine.
    • LDA - Linear Discriminant Analysis.
    • Logistic Regression - Regression adapted for binary classifications.
    • PNN - Probabilistic Neural Network.
    • CCNN - Cascade Correlation Neural Network.
    The numbers in the table are the percent misclassification (error) for each method on each benchmark. The misclassification percentages were computed using 10-fold cross-validation for all methods except for Decision Tree Forests which used Out-Of-Bag (OOB) validation and Probabilistic Neural Networks which used Leave-One-Out (LOO) validation.

    Logistic regression could not be performed for benchmark problems that have more than two target categories.

    Many of the benchmark data sets came from the UCI Machine Learning Repository.

    The best result for each benchmark is shown in bold face type. The worst result is in italics.

    Percent Misclassification Using Validation Data
    Benchmark Single tree TreeBoost Tree forest SVM LDA Logistic reg. PNN CCNN
    Abalone 46 45 45 45 45   44 44
    Ad 3 3 3 3 3   2 3
    AdultCensus 20 14 15 20 16   13 15
    Anneal steel 11 5 23 11 18   6 9
    Argentina 27 14 24 32 24 28 16 20
    Astroparticle 4 2 3 3 10 5 3 3
    Audiology 61 19 36 15 25   9 20
    AustralianCrabSex 14 8 7 3 4 4 4 2
    AustralianCredit 15 13 13 24 15 15 12 14
    Balance 22 15 18 0 14   10 4
    Bands 29 19 20 25 26   7 24
    Bioinformatics 38 18 21 16 17 21 16 18
    Bisbey 19 10 12 18 18   14 16
    Bridges 30 24 36 57 33   25 24
    Car 4 1 4 0 10   13 2
    ChurchParticipation 63 56 58 55 49   47 49
    ClevelandHeart14 58 46 44 48 40   39 41
    ColonTumor 19 8 16 20 23   10 24
    Contraception 44 44 44 46 49   45 45
    CreditApplication 14 9 13 14 14   12 14
    DeathPenalty 31 25 28 31 25 26 7 24
    Dermatology 4 3 3 0 3   0 4
    DNA 7 4 30 4 6   4 5
    Ecoli 22 14 14 18 13   12 13
    ElectroCardiogram 26 23 24 25 26 27 22 23
    EvansCounty 12 11 11 25 10 10 10 10
    Federalist 5 5 5 4 15 12 0 6
    Flags 45 32 46 36 36   25 44
    Fraud 26 29 31 45 29 29 14 33
    GermanCredit 33 24 25 23 24 24 20 24
    GfaNormaux 6 5 6 4 8 8 2 6
    Glass 30 24 22 34 36   12 34
    GymTutor 7 4 10 4 11   2 4
    Haberman 30 33 39 33 25 25 24 25
    Hayes-Roth 15 20 33 23 33   16 26
    Heart13 46 21 16 16 21 25 14 13
    Hepatitis 21 15 21 21 15 20 6 15
    HorseColic 20 17 18 16 18   7 21
    HOSLEM 50 33 33 34 31 33 25 31
    HouseVotes 5 4 3 3 4 3 2 3
    InsuranceFraud 27 18 31 29 28   5 19
    Ionosphere 9 7 6 5 14 13 8 10
    Iris 5 3 3 3 2   3 4
    Labor-neg 12 10 5 7 10 10 0 12
    Lenses 25 17 42 12 12   4 21
    Letter-recognition 14 4 0 2 30   2 20
    LibSvmVehicle 25 16 16 20 18   17 17
    LiverDisorder 32 29 26 29 30 34 30 29
    LowBwt 36 34 34 35 31 36 30 29
    LungCancer 50 50 87 47 50   6 56
    Lymphography 22 14 22 17 21   5 16
    Marketing 53 48 51 50 47   45 44
    Microchip 33 35 38 38 34 34 34 34
    Mushrooms 0 0 0 0 0   0 0
    Musk 24 10 10 5 18 18 4 13
    NLS 42 33 31 31 30   29 29
    Nursery 2 0 45 0 47   2 0
    NursingHome 20 7 6 16 6 16 4 5
    OilSpill 12 7 18 10 3   3 3
    Optdigits 10 2 2 1 4   1 3
    Pageblocks 8 5 2 8 5   3 3
    PenDigits 4 2 1 0 11   1 2
    P.I.-Diabetes 25 24 26 24 23 26 16 22
    PostOperative 58 42 43 51 38   29 37
    PrimaryTumor 74 59 68 60 55   55 52
    Reuters 11 5 4 3 23   11 7
    RingNorm 13 2 4 1 23 24 49 7
    SalesPlan 65 59 61 56 61   63 59
    Satellite 15 8 8 8 16   8 12
    SatImage 13 8 8 7 16   10 12
    Segment 5 2 2 0 8   3 5
    Shuttle 1 1 1 0 6   0 0
    Smoking 65 49 41 66 32   31 32
    Sonar 24 13 13 13 24 26 1 26
    SpamBase 7 6 5 6 10 7 9 7
    Spectf 27 17 20 21 41 39 6 22
    Splice DNA 5 4 35 3 5   3 5
    Spiral 47 42 47 8 51 51 24 57
    SvmTumor 46 25 24 24 33   60 33
    Tae 51 47 61 51 50   48 54
    Thyroid (ANN) 2 2 3 3 6   2 1
    Tic-tac-toe 6 1 1 0 2 2 2 2
    Tin 49 29 28 34 26 26 24 26
    Titanic 21 24 22 21 22 22 21 21
    TorchClassif 16 9 7 9 27 27 3 13
    Twonorm 15 2 3 2 2 2 2 2
    UTI 69 78 85 71 62   64 64
    Vehicle 28 24 24 14 22   25 22
    Vibration 60 60 60 61 51   48 49
    Vowel 17 7 3 2 28   0 11
    Waveform 22 15 15 13 14   15 13
    wbdc 7 5 6 4 7 7 6 7
    Wine 8 3 2 1 2   0 2
    Zoo 12 7 7 4 9   1 4
    Average error 24.80 18.61 22.19 19.79 22.21 20.42 15.38 19.26
    Median error 21.00 14.00 18.00 16.00 21.00 23.00 10.00 15.50
    Num. times best 4 14 8 25 5 2 53 16
    Num. times worst 42 2 17 14 24 8 3 9



    Return to DTREG home page

    Download demonstration copy of DTREG.

    Download manual for DTREG.

    Download manual for DTREG COM Library.

    Purchase DTREG.