What's new?

Data Mining
CS 510 (DM)
Winter,2004
home | news | site map
review | project | subject | group
weka | mining | gawk | bash
modeling | reference | pods
Display: big | small

Why all the scripting?

Sun Feb 15 2004

The perfect icon for treatment learning: the bumblebee bat http://www.cs.pdx.edu/~timm/dm/bumblebee.html

Wed Feb 11 2004

Week 5 questions available http://www.cs.pdx.edu/~timm/dm/review.html#week5

Sun Feb 08 2004

A naive bayes classifier http://www.cs.pdx.edu/~timm/dm/nbc.html

Wed Feb 04 2004

Detailed log of some real world data mining http://www.cs.pdx.edu/~timm/dm/inspectlog.html

Tues Feb 03 2004

zapblanks: remove empty columns and rows http://www.cs.pdx.edu/~timm/dm/zapblanks.html

Week 4 review questions: http://www.cs.pdx.edu/~timm/dm/review.html#week4

Dos2Unix file conversion: http://www.cs.pdx.edu/~timm/dm/dos2unix.html

Mon Feb 02 2004

Treatment learning: http://www.cs.pdx.edu/~timm/dm/rx.html

Find stable treatments in the outpt of a cross-val study: http://www.cs.pdx.edu/~timm/dm/stable.html

Fri Jan 27 2004

Extracting ranges from a file: http://www.cs.pdx.edu/~timm/dm/ranges.html

Discretization example: http://www.cs.pdx.edu/~timm/dm/nbins.html

Wed Jan 26 2004

Project details weeks 5,6,7 available from http://www.cs.pdx.edu/~timm/dm/project.html

Tues Jan 25 2004

Generate train and test sets http://www.cs.pdx.edu/~timm/dm/traintest.html

randomarff:: generate randomly sorted arff file http://www.cs.pdx.edu/~timm/dm/randomarff.html

Review questions from week 3: http://www.cs.pdx.edu/~timm/dm/review.html#week3

One-line WEKA calls http://www.cs.pdx.edu/~timm/dm/wekatools.html

Scientific Graphics in Pod (using Gnuplot) http://www.cs.pdx.edu/~timm/dm/gnuplot.html

Mon Jan 24 2004

cocomo: software cost estimation http://www.cs.pdx.edu/~timm/dm/cocomo

cocomo expert: software risk estimation http://www.cs.pdx.edu/~timm/dm/cocomoexpert.html

stuff: batch up all the site maintenance stuff http://www.cs.pdx.edu/~timm/dm/stuff

Why all the scripting? http://www.cs.pdx.edu/~timm/dm/gawk4dm.html

GAWK and teaching http://www.cs.pdx.edu/~timm/dm/gawk4teaching.html

Sun Jan 24 2004

GAWK and AI: http://www.cs.pdx.edu/~timm/dm/gawk4ai.html

Simple assumption management http://www.cs.pdx.edu/~timm/dm/assume.html.

Importing tables into AWK http://www.cs.pdx.edu/~timm/dm/readTableEg.html.

Sat Jan 24 2004

Site map added http://www.cs.pdx.edu/~timm/dm/sitemap.html.

Auto-build a site map: http://www.cs.pdx.edu/~timm/dm/makemap.html.

Representing the site map of this site: http://www.cs.pdx.edu/~timm/dm/sitemap.sh.

Building zip files for the files referenced by our pod files: http://www.cs.pdx.edu/~timm/dm/zips.html.

Extracting referenced files from pod files: http://www.cs.pdx.edu/~timm/dm/files.html.

Fri Jan 23 2004

A library of useful gawk functions: http://www.cs.pdx.edu/~timm/dm/lib.html

Thurs Jan 22 2004

Search engine added to all pages (note: indexes only update every week).

Advanced search page added: http://www.cs.pdx.edu/~timm/dm/search.html

Tutorial on different kinds of learners: http://www.cs.pdx.edu/~timm/dm/kinds.html

Wed Jan 21 2004

AWK compared to other languages (measured in terms of lines of code/ function point): see http://www.cs.pdx.edu/~timm/dm/gawk101.html; and http://www.cs.pdx.edu/~timm/dm/functionpoints.html;

Reports/print-outs are due Thursday THIS WEEK and Wednesday all weeks after this.

Changes to http://www.cs.pdx.edu/~timm/dm/index.html#assessment:

Tue Jan 20 2004

EXAM:
Week 2 review questions available: http://www.cs.pdx.edu/~timm/dm/review.html#week2

POD:
Context menu added top-right of each page.

BASH:
Simple frequency counting: http://www.cs.pdx.edu/~timm/dm/freqx.html

Simple histograms: http://www.cs.pdx.edu/~timm/dm/bars.html

Percentile chops: http://www.cs.pdx.edu/~timm/dm/percentile.html

Template for bash applications: http://www.cs.pdx.edu/~timm/dm/template.html

Mon Jan 19, 2004

Notes on advanced Pod-ding http://www.cs.pdx.edu/~timm/dm/site.html

Wed Jan 7 2003

Project specification released: http://www.cs.pdx.edu/~timm/dm/project.html ;

Lectures notes available: http://www.cs.pdx.edu/~timm/dm/index.html#assessment ;

Notes on use the WEKA from the command line: http://www.cs.pdx.edu/~timm/dm/commandlineweka.html .

[TOP]


Credits

Author

Tim Menzies , tim@menzies.us, http://menzies.us

Software

This page generated by Site: see http://www.cs.pdx.edu/~timm/dm/site.html

Acknowledgements

This site is built using PerlPod.

Style sheet switching method taken from Eddie Traversa's excellent and simple-to-apply tutorial: http://dhtmlnirvana.com/content/styleswitch/styleswitch1.html.

Search engine powered by ATOMZ http://www.atomz.com/search/. Note, the indexes to this site are only updated weekly (heh, its a free service- what more ja want?).

Icons on this site come from http://www.sql-news.de/rubriken/olap.asp and http://www.ifnet.it/webif/centrodi/eng/toolbar.htm.

The JAVA machine learners used at this site come from the extensive data mining libraries found in the University of Waikato's Environment for Knowledge Analysis (the WEKA) http://www.cs.waikato.ac.nz/ml/weka/

[TOP]


Legal

Copyright

Copyright (C) Tim Menzies 2004

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, version 2; see http://www.gnu.org/copyleft/gpl.html. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA.

Disclaimer

The content from or through this web page are provided 'as is' and the author makes no warranties or representations regarding the accuracy or completeness of the information. Your use of this web page and information is at your own risk. You assume full responsibility and risk of loss resulting from the use of this web page or information. If your use of materials from this page results in the need for servicing, repair or correction of equipment, you assume any costs thereof. Follow all external links at your own risk and liability.

[TOP]