Format of the Training Set
For each loaded file (either
training or test set) there needs to be a description file in the same
directory. While the training set has the suffix ".train" (whole file f.ex.
"chess.train") and the test set the suffix ".test" (chess.test), the description
file must have the suffix ".desc" (chess.desc).
Format of the
training and test set:
-
Each row contains one record
-
All attributes are separated by "/t".
-
The class attribute is the first attribute
and must be a number
Format of the description
file:
-
It consists of blocks
with a preceding block identifier.
-
The attribute block
contains the attribute names and their type (either "continuous" or "categorical").
The name and the type is separated by a " ".
-
The classes block
assignes class names to the numbers used in the test and training set. They
are separated by a "=".
For example:
attributes:
Attribute1 continuous
Attribute2 continuous
Attribute3 continuous
classes:
1=Rad Flow
2=Fpv Close