fluid.metrics¶
Accuracy¶
-
class
paddle.fluid.metrics.
Accuracy
(name=None)[source] Calculate the mean accuracy over multiple batches. https://en.wikipedia.org/wiki/Accuracy_and_precision
- Parameters
name – the metrics name
Examples
import paddle.fluid as fluid #suppose we have batch_size = 128 batch_size=128 accuracy_manager = fluid.metrics.Accuracy() #suppose the accuracy is 0.9 for the 1st batch batch1_acc = 0.9 accuracy_manager.update(value = batch1_acc, weight = batch_size) print("expect accuracy: %.2f, get accuracy: %.2f" % (batch1_acc, accuracy_manager.eval())) #suppose the accuracy is 0.8 for the 2nd batch batch2_acc = 0.8 accuracy_manager.update(value = batch2_acc, weight = batch_size) #the joint acc for batch1 and batch2 is (batch1_acc * batch_size + batch2_acc * batch_size) / batch_size / 2 print("expect accuracy: %.2f, get accuracy: %.2f" % ((batch1_acc * batch_size + batch2_acc * batch_size) / batch_size / 2, accuracy_manager.eval())) #reset the accuracy_manager accuracy_manager.reset() #suppose the accuracy is 0.8 for the 3rd batch batch3_acc = 0.8 accuracy_manager.update(value = batch3_acc, weight = batch_size) print("expect accuracy: %.2f, get accuracy: %.2f" % (batch3_acc, accuracy_manager.eval()))
-
update
(value, weight) Update minibatch states.
- Parameters
value (float|numpy.array) – accuracy of one minibatch.
weight (int|float) – batch size.
-
eval
() Return the mean accuracy (float or numpy.array) for all accumulated batches.
-
get_config
() Get the metric and current states. The states are the members who do not has “_” prefix.
- Parameters
None –
- Returns
a dict of metric and states
- Return type
dict
-
reset
() reset clear the states of metrics. By default, the states are the members who do not has _ prefix, reset set them to inital states. If you violate the implicit name rule, please also custom the reset interface.
Auc¶
-
class
paddle.fluid.metrics.
Auc
(name, curve='ROC', num_thresholds=4095)[source] The auc metric is for binary classification. Refer to https://en.wikipedia.org/wiki/Receiver_operating_characteristic#Area_under_the_curve Please notice that the auc metric is implemented with python, which may be a little bit slow. If you concern the speed, please use the fluid.layers.auc instead.
The auc function creates four local variables, true_positives, true_negatives, false_positives and false_negatives that are used to compute the AUC. To discretize the AUC curve, a linearly spaced set of thresholds is used to compute pairs of recall and precision values. The area under the ROC-curve is therefore computed using the height of the recall values by the false positive rate, while the area under the PR-curve is the computed using the height of the precision values by the recall.
- Parameters
name – metric name
curve – Specifies the name of the curve to be computed, ‘ROC’ [default] or ‘PR’ for the Precision-Recall-curve.
“NOTE: only implement the ROC curve type via Python now.”
Examples
import paddle.fluid as fluid import numpy as np # init the auc metric auc_metric = fluid.metrics.Auc("ROC") # suppose that batch_size is 128 batch_num = 100 batch_size = 128 for batch_id in range(batch_num): class0_preds = np.random.random(size = (batch_size, 1)) class1_preds = 1 - class0_preds preds = np.concatenate((class0_preds, class1_preds), axis=1) labels = np.random.randint(2, size = (batch_size, 1)) auc_metric.update(preds = preds, labels = labels) # shall be some score closing to 0.5 as the preds are randomly assigned print("auc for iteration %d is %.2f" % (batch_id, auc_metric.eval()))
-
update
(preds, labels) Update the auc curve with the given predictions and labels
- Parameters
preds – an numpy array in the shape of (batch_size, 2), preds[i][j] denotes the probability
classifying the instance i into the class j. (of) –
labels – an numpy array in the shape of (batch_size, 1), labels[i] is either o or 1, representing
label of the instance i. (the) –
-
eval
() Return the area (a float score) under auc curve
-
get_config
() Get the metric and current states. The states are the members who do not has “_” prefix.
- Parameters
None –
- Returns
a dict of metric and states
- Return type
dict
-
reset
() reset clear the states of metrics. By default, the states are the members who do not has _ prefix, reset set them to inital states. If you violate the implicit name rule, please also custom the reset interface.
ChunkEvaluator¶
-
class
paddle.fluid.metrics.
ChunkEvaluator
(name=None)[source] Accumulate counter numbers output by chunk_eval from mini-batches and compute the precision recall and F1-score using the accumulated counter numbers. For some basics of chunking, please refer to Chunking with Support Vector Machines . ChunkEvalEvaluator computes the precision, recall, and F1-score of chunk detection, and supports IOB, IOE, IOBES and IO (also known as plain) tagging schemes.
Examples
import paddle.fluid as fluid # init the chunck-level evaluation manager metric = fluid.metrics.ChunkEvaluator() # suppose the model predict 10 chuncks, while 8 ones are correct and the ground truth has 9 chuncks. num_infer_chunks = 10 num_label_chunks = 9 num_correct_chunks = 8 metric.update(num_infer_chunks, num_label_chunks, num_correct_chunks) numpy_precision, numpy_recall, numpy_f1 = metric.eval() print("precision: %.2f, recall: %.2f, f1: %.2f" % (numpy_precision, numpy_recall, numpy_f1)) # the next batch, predicting 3 prefectly correct chuncks. num_infer_chunks = 3 num_label_chunks = 3 num_correct_chunks = 3 metric.update(num_infer_chunks, num_label_chunks, num_correct_chunks) numpy_precision, numpy_recall, numpy_f1 = metric.eval() print("precision: %.2f, recall: %.2f, f1: %.2f" % (numpy_precision, numpy_recall, numpy_f1))
-
update
(num_infer_chunks, num_label_chunks, num_correct_chunks) Update the states based on the layers.chunk_eval() ouputs.
- Parameters
num_infer_chunks (int|numpy.array) – The number of chunks in Inference on the given minibatch.
num_label_chunks (int|numpy.array) – The number of chunks in Label on the given mini-batch.
num_correct_chunks (int|float|numpy.array) – The number of chunks both in Inference and Label on the given mini-batch.
-
eval
() Evalute the current metrics based the accumulated states.
- Returns
the metrics via Python.
- Return type
float|list(float)|numpy.array
-
get_config
() Get the metric and current states. The states are the members who do not has “_” prefix.
- Parameters
None –
- Returns
a dict of metric and states
- Return type
dict
-
reset
() reset clear the states of metrics. By default, the states are the members who do not has _ prefix, reset set them to inital states. If you violate the implicit name rule, please also custom the reset interface.
-
CompositeMetric¶
-
class
paddle.fluid.metrics.
CompositeMetric
(name=None)[source] Composite multiple metrics in one instance. for example, merge F1, accuracy, recall into one Metric.
Examples
import paddle.fluid as fluid import numpy as np preds = [[0.1], [0.7], [0.8], [0.9], [0.2], [0.2], [0.3], [0.5], [0.8], [0.6]] labels = [[0], [1], [1], [1], [1], [0], [0], [0], [0], [0]] preds = np.array(preds) labels = np.array(labels) comp = fluid.metrics.CompositeMetric() precision = fluid.metrics.Precision() recall = fluid.metrics.Recall() comp.add_metric(precision) comp.add_metric(recall) comp.update(preds=preds, labels=labels) numpy_precision, numpy_recall = comp.eval() print("expect precision: %.2f, got %.2f" % ( 3. / 5, numpy_precision ) ) print("expect recall: %.2f, got %.2f" % (3. / 4, numpy_recall ) )
-
add_metric
(metric) add one metric instance to CompositeMetric.
- Parameters
metric – a instance of MetricBase.
-
update
(preds, labels) Update every metrics in sequence.
- Parameters
preds (numpy.array) – the predictions of current minibatch
labels (numpy.array) – the labels of current minibatch, if the label is one-hot or soft-label, should custom the corresponding update rule.
-
eval
() Evaluate every metrics in sequence.
- Returns
a list of metrics value in Python.
- Return type
list(float|numpy.array)
-
get_config
() Get the metric and current states. The states are the members who do not has “_” prefix.
- Parameters
None –
- Returns
a dict of metric and states
- Return type
dict
-
reset
() reset clear the states of metrics. By default, the states are the members who do not has _ prefix, reset set them to inital states. If you violate the implicit name rule, please also custom the reset interface.
-
DetectionMAP¶
-
class
paddle.fluid.metrics.
DetectionMAP
(input, gt_label, gt_box, gt_difficult=None, class_num=None, background_label=0, overlap_threshold=0.5, evaluate_difficult=True, ap_version='integral')[source] Calculate the detection mean average precision (mAP).
The general steps are as follows:
calculate the true positive and false positive according to the input of detection and labels.
calculate mAP value, support two versions: ‘11 point’ and ‘integral’.
Please get more information from the following articles:
- Parameters
input (Variable) – The detection results, which is a LoDTensor with shape [M, 6]. The layout is [label, confidence, xmin, ymin, xmax, ymax].
gt_label (Variable) – The ground truth label index, which is a LoDTensor with shape [N, 1].
gt_box (Variable) – The ground truth bounding box (bbox), which is a LoDTensor with shape [N, 4]. The layout is [xmin, ymin, xmax, ymax].
gt_difficult (Variable|None) – Whether this ground truth is a difficult bounding bbox, which can be a LoDTensor [N, 1] or not set. If None, it means all the ground truth labels are not difficult bbox.
class_num (int) – The class number.
background_label (int) – The index of background label, the background label will be ignored. If set to -1, then all categories will be considered, 0 by default.
overlap_threshold (float) – The threshold for deciding true/false positive, 0.5 by default.
evaluate_difficult (bool) – Whether to consider difficult ground truth for evaluation, True by default. This argument does not work when gt_difficult is None.
ap_version (string) – The average precision calculation ways, it must be ‘integral’ or ‘11point’. Please check https://sanchom.wordpress.com/tag/average-precision/ for details. - 11point: the 11-point interpolated average precision. - integral: the natural integral of the precision-recall curve.
Examples
import paddle.fluid as fluid import paddle.fluid.layers as layers batch_size = -1 # can be any size image_boxs_num = 10 bounding_bboxes_num = 21 pb = layers.data(name='prior_box', shape=[image_boxs_num, 4], append_batch_size=False, dtype='float32') pbv = layers.data(name='prior_box_var', shape=[image_boxs_num, 4], append_batch_size=False, dtype='float32') loc = layers.data(name='target_box', shape=[batch_size, bounding_bboxes_num, 4], append_batch_size=False, dtype='float32') scores = layers.data(name='scores', shape=[batch_size, bounding_bboxes_num, image_boxs_num], append_batch_size=False, dtype='float32') nmsed_outs = fluid.layers.detection_output(scores=scores, loc=loc, prior_box=pb, prior_box_var=pbv) gt_box = fluid.layers.data(name="gt_box", shape=[batch_size, 4], dtype="float32") gt_label = fluid.layers.data(name="gt_label", shape=[batch_size, 1], dtype="float32") difficult = fluid.layers.data(name="difficult", shape=[batch_size, 1], dtype="float32") exe = fluid.Executor(fluid.CUDAPlace(0)) map_evaluator = fluid.metrics.DetectionMAP(nmsed_outs, gt_label, gt_box, difficult, class_num = 3) cur_map, accum_map = map_evaluator.get_map_var() # see detailed examples at https://github.com/PaddlePaddle/models/blob/43cdafbb97e52e6d93cc5bbdc6e7486f27665fc8/PaddleCV/object_detection
-
get_map_var
() - Returns: mAP variable of current mini-batch and
accumulative mAP variable cross mini-batches.
-
reset
(executor, reset_program=None) Reset metric states at the begin of each pass/user specified batch.
- Parameters
executor (Executor) – a executor for executing the reset_program.
reset_program (Program|None) – a single Program for reset process. If None, will create a Program.
EditDistance¶
-
class
paddle.fluid.metrics.
EditDistance
(name)[source] Edit distance is a way of quantifying how dissimilar two strings (e.g., words) are to each another by counting the minimum number of edit operations (add, remove or replace) required to transform one string into the other. Refer to https://en.wikipedia.org/wiki/Edit_distance
This EditDistance class takes two inputs by using update function: 1. distances: a (batch_size, 1) numpy.array, each element represents the edit distance between two sequences. 2. seq_num: a int|float value, standing for the number of sequence pairs.
and returns the overall edit distance of multiple sequence-pairs.
- Parameters
name – the metrics name
Examples
import paddle.fluid as fluid import numpy as np # suppose that batch_size is 128 batch_size = 128 # init the edit distance manager distance_evaluator = fluid.metrics.EditDistance("EditDistance") # generate the edit distance across 128 sequence pairs, the max distance is 10 here edit_distances_batch0 = np.random.randint(low = 0, high = 10, size = (batch_size, 1)) seq_num_batch0 = batch_size distance_evaluator.update(edit_distances_batch0, seq_num_batch0) avg_distance, wrong_instance_ratio = distance_evaluator.eval() print("the average edit distance for batch0 is %.2f and the wrong instance ratio is %.2f " % (avg_distance, wrong_instance_ratio)) edit_distances_batch1 = np.random.randint(low = 0, high = 10, size = (batch_size, 1)) seq_num_batch1 = batch_size distance_evaluator.update(edit_distances_batch1, seq_num_batch1) avg_distance, wrong_instance_ratio = distance_evaluator.eval() print("the average edit distance for batch0 and batch1 is %.2f and the wrong instance ratio is %.2f " % (avg_distance, wrong_instance_ratio)) distance_evaluator.reset() edit_distances_batch2 = np.random.randint(low = 0, high = 10, size = (batch_size, 1)) seq_num_batch2 = batch_size distance_evaluator.update(edit_distances_batch2, seq_num_batch2) avg_distance, wrong_instance_ratio = distance_evaluator.eval() print("the average edit distance for batch2 is %.2f and the wrong instance ratio is %.2f " % (avg_distance, wrong_instance_ratio))
-
update
(distances, seq_num) Update the overall edit distance
- Parameters
distances – a (batch_size, 1) numpy.array, each element represents the
distance between two sequences. (edit) –
seq_num – a int|float value, standing for the number of sequence pairs.
-
eval
() Return two floats: avg_distance: the average distance for all sequence pairs updated using the update function. avg_instance_error: the ratio of sequence pairs whose edit distance is not zero.
-
get_config
() Get the metric and current states. The states are the members who do not has “_” prefix.
- Parameters
None –
- Returns
a dict of metric and states
- Return type
dict
-
reset
() reset clear the states of metrics. By default, the states are the members who do not has _ prefix, reset set them to inital states. If you violate the implicit name rule, please also custom the reset interface.
MetricBase¶
-
class
paddle.fluid.metrics.
MetricBase
(name)[source] Base Class for all Metrics. MetricBase define a group of interfaces for the model evaluation methods. Metrics accumulate metric states between consecutive minibatches, at every minibatch, use update interface to add current minibatch value to global states. Use eval to compute accumative metric value from last reset() or from scratch on. If you need to custom a new metric, please inherit from MetricBase and custom implementation.
- Parameters
name (str) – The name of metric instance. such as, “accuracy”. It needed if you want to distinct different metrics in a model.
-
reset
() reset clear the states of metrics. By default, the states are the members who do not has _ prefix, reset set them to inital states. If you violate the implicit name rule, please also custom the reset interface.
-
get_config
() Get the metric and current states. The states are the members who do not has “_” prefix.
- Parameters
None –
- Returns
a dict of metric and states
- Return type
dict
-
update
(preds, labels) Updates the metric states at every minibatch. One user can compute the minibatch metric via pure Python, or via a c++ operator.
- Parameters
preds (numpy.array) – the predictions of current minibatch
labels (numpy.array) – the labels of current minibatch, if the label is one-hot or soft-label, should custom the corresponding update rule.
-
eval
() Evalute the current metrics based the accumulated states.
- Returns
the metrics via Python.
- Return type
float|list(float)|numpy.array
Precision¶
-
class
paddle.fluid.metrics.
Precision
(name=None)[source] Precision (also called positive predictive value) is the fraction of relevant instances among the retrieved instances. https://en.wikipedia.org/wiki/Evaluation_of_binary_classifiers
This class mangages the precision score for binary classification task.
Examples
import paddle.fluid as fluid import numpy as np metric = fluid.metrics.Precision() # generate the preds and labels preds = [[0.1], [0.7], [0.8], [0.9], [0.2], [0.2], [0.3], [0.5], [0.8], [0.6]] labels = [[0], [1], [1], [1], [1], [0], [0], [0], [0], [0]] preds = np.array(preds) labels = np.array(labels) metric.update(preds=preds, labels=labels) numpy_precision = metric.eval() print("expct precision: %.2f and got %.2f" % ( 3.0 / 5.0, numpy_precision))
-
update
(preds, labels) Updates the metric states at every minibatch. One user can compute the minibatch metric via pure Python, or via a c++ operator.
- Parameters
preds (numpy.array) – the predictions of current minibatch
labels (numpy.array) – the labels of current minibatch, if the label is one-hot or soft-label, should custom the corresponding update rule.
-
eval
() Evalute the current metrics based the accumulated states.
- Returns
the metrics via Python.
- Return type
float|list(float)|numpy.array
-
get_config
() Get the metric and current states. The states are the members who do not has “_” prefix.
- Parameters
None –
- Returns
a dict of metric and states
- Return type
dict
-
reset
() reset clear the states of metrics. By default, the states are the members who do not has _ prefix, reset set them to inital states. If you violate the implicit name rule, please also custom the reset interface.
-
Recall¶
-
class
paddle.fluid.metrics.
Recall
(name=None)[source] Recall (also known as sensitivity) is the fraction of relevant instances that have been retrieved over the total amount of relevant instances
https://en.wikipedia.org/wiki/Precision_and_recall
This class mangages the recall score for binary classification task.
Examples
import paddle.fluid as fluid import numpy as np metric = fluid.metrics.Recall() # generate the preds and labels preds = [[0.1], [0.7], [0.8], [0.9], [0.2], [0.2], [0.3], [0.5], [0.8], [0.6]] labels = [[0], [1], [1], [1], [1], [0], [0], [0], [0], [0]] preds = np.array(preds) labels = np.array(labels) metric.update(preds=preds, labels=labels) numpy_precision = metric.eval() print("expct precision: %.2f and got %.2f" % ( 3.0 / 4.0, numpy_precision))
-
update
(preds, labels) Updates the metric states at every minibatch. One user can compute the minibatch metric via pure Python, or via a c++ operator.
- Parameters
preds (numpy.array) – the predictions of current minibatch
labels (numpy.array) – the labels of current minibatch, if the label is one-hot or soft-label, should custom the corresponding update rule.
-
eval
() Evalute the current metrics based the accumulated states.
- Returns
the metrics via Python.
- Return type
float|list(float)|numpy.array
-
get_config
() Get the metric and current states. The states are the members who do not has “_” prefix.
- Parameters
None –
- Returns
a dict of metric and states
- Return type
dict
-
reset
() reset clear the states of metrics. By default, the states are the members who do not has _ prefix, reset set them to inital states. If you violate the implicit name rule, please also custom the reset interface.
-