pyml.neighbors.knn.kNNClassifier#

class kNNClassifier(k=3, metric='euclidean')[source]#

Bases: object

Classifier model using the nearest neighbor algorithm

K-nearest neighbor (KNN) is a simple and intuitive machine learning algorithm, that can be used for classification and regression tasks. In the case of classification the model predicts the class of an data point based on the majority class or average of its K nearest data points in the feature space.

Following metrics are support: - euclidean - manhatten

Parameters:

k (int, optional) – Specifies the number of nearest neighbor to consider when predicting on new data. By default 3.
metric (str, optional) – Specifies the metric used for calculating the distance By default ‘euclidean’.

Variables:

metrics (List[str]) – Defines the metrics that are currently supported

Raises:

UnknownMetric – Raised when using an unknow metric name (including spelling errors)
ShapeError – Raised when computing the distance for incompatible matrices

Methods

`__init__`
`fit`	Fit model on training data
`predict`	Calculates predictions for given data points

Attributes

metrics

_compute_distance(x1, x2)[source]#

Computes the distance between two matrix-like objects using the defined metric

One of the parameters must be a matrix with only one row or alternativly a vector.

Return type:

array

Parameters:

x1 (numpy.ndarray) – Input matrix
x2 (numpy.ndarray) – Input matrix

Returns:

Matrix consisting of the distances

Return type:

numpy.ndarray

Raises:

ShapeError – If shapes do not match a shape error