|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
![]() |
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Display:
Description:
The parametrizing window has two tabs:
The choice of quantification is made according to the type of data input. Users who are unfamiliar with this operation will very quickly learn to recognize the correct type of quantification, especially with the help of the box and whiskers presented in the Drawings tab. The one selected by default is quantiles but it is not automatically the best. The class number can be a figure between 1 and 512, except during calculation of recursive averages, for
which the number of classes must be a power of 2 (2, 4, 8, 16, 32... 512). ATTENTION:
generally, the number of classes should not be more than 12, to enable clear representation in colors or qualitative
symbols.
The standard deviation factor allow to choose a multiplying factor which will modify the class width for the standard and standard zero centered quantifications. For quantification by custom thresholds, the thresholds can be
The button representing an arrow, which is to be found under the threshold table, enables values to be transferred from the automatic thresholds to the custom thresholds column. The text area situated below the Parameters tab contains the Jenks and Tai indicators, the
inter-group and intra-group variances, the number of elements contained in each class, and a statistical
summary of the data input. See the paragraph on indicators below.
The Drawings tab, a copy of whose display can be seen above, contains :
When the mouse pointer is moved over the graphic area, the value under the pointer is displayed in the Data value zone located under the graphic area. For quantification by custom thresholds, gray vertical bars representing the thresholds appear on the graphic
area. You can move them thanks to the mouse to set manually the thresholds with the help of the drawings. Their moving
is only allowed between the minimum and maximum values of the data.
The Center button enables the graphs to be re-centered if you have moved the drawings using the scroll bars. If the window size is changed, the drawings are automatically adjusted to the new size. Drawings :
Types of data:
On the other hand, if the module already has output connections, its input type is fixed. To connect a matrix data input, it must first be disconnected from its sons. Formulas used for the indicators: Jenks' indicator (from its author's name) : deviation of a class = absolute value of:
Jenks' indicator = (sum total of the deviations of each class) / number of classes Tai indicator (Tabular Accuracy Index from Jenks also) : Distance1 = sum total of the distances between the values and the average of the class in which they are found
Tai indicator = 1 - Distance1 / Distance2 The closer the Tai indicator is to one, the better the quantification.
Inter-group variance: This indicator shows whether the classes obtained are similar or different. Distance = sum total of the squares of the distances between the average of each class and the overall average of the data multiplied by the number of data inputs in the class Inter-group variance = Distance / (number of classes - 1) Intra-group variance: This indicator shows whether the classes obtained are homogeneous or heterogeneous. Distance = sum total of the squares of the distances between the values and the average of the class in which they are to be found Intra-group variance = Distance / (number of data inputs - number of classes) When comparing different quantifications, look carefully at these variances. You should try to increase the inter-group variance, and reduce the intra-group variance. List of quantifications: Standard:
With a same value for the standard deviation factor, this quantification gives rise to only two types of
distribution:
In fact, for example for 3, 5, 7, etc.classes with a standard deviation factor equal to 1, the central class is always centered on the average, and always has a width of one standard deviation. This type of quantification is not suitable for relative data; for example, if you are mapping a percentage, the average of the data is not the average of the percentages. Standard zero centered:
Quantiles:
Equal sizes:
Recursive averages:
This type of quantification is not suitable for relative data; for example, if you are mapping a percentage, the average of the data is not the average of the percentages. Jenks:
Custom:
Principle shared by quantifications (except quantiles):
Attention: the thresholds are stored using simple precision; this means that quantification cannot distinguish between values whose first eight figures are identical. Script : 2 module untyped_list "" 3 mod_type integer "103" 3 mod_subtype integer "502" 3 mod_name string "Discretisation" 3 mod_dads integer_list "" 4 ? integer "4" 3 work_on_matrix boolean "F" 3 quant_type integer "802" 3 class_nb integer "4" 3 stddur_fact double "1" 3 user_thresholds double_list "" 4 ? double "0" 4 ? double "0" 4 ? double "0" 4 ? double "0" 4 ? double "0" 3 auto_thresholds double_list "" 4 ? double "17" 4 ? double "45" 4 ? double "65" 4 ? double "78" 4 ? double "95" 3 classes_count integer_list "" 4 ? integer "4" 4 ? integer "4" 4 ? integer "4" 4 ? integer "3" 3 jenks double "0.013676485" 3 tai double "0.63260762" 3 inter_grp double "2054.4944" 3 intra_grp double "75.113636" Values for quant_type: Custom 801 Quantiles 802 Equal sizes 803 Recursive averages 804 Standard 805 Standard centered on zero 806 Jenks 807 In the list of automatic thresholds, we can see that the minimum and maximum of the values are also in the list. These values are obligatory, but they are recalculated by the module. It is thus possible to add zeros. Samples
|