What is Lasso (Least Absolute Shrinkage And Selection Operator) ?

Lasso (Least Absolute Shrinkage And Selection Operator) in statistics and machine learning is a regression analysis method that performs both variable selection and regularization in order to enhance the prediction accuracy and interpretability of the statistical model it produces. Lasso was originally formulated for least squares models and this simple case reveals a substantial amount about the behaviour of the … Read more

What is Kolmogorov-Smirnov test?

Kolmogorov-Smirnov test (K–S test or KS test) is a nonparametric test of the equality of continuous, one-dimensional probability distributions that can be used to compare a sample with a reference probability distribution (one-sample K–S test), or to compare two samples (two-sample K–S test). The Kolmogorov–Smirnov statistic quantifies a distance between the empirical distribution function of the sample and the cumulative … Read more

What is K-Nearest Neighbour (KNN)?

K-Nearest Neighbour (KNN) in pattern recognition is a non-parametric method used for classification and regression. In both cases, the input consists of the k closest training examples in the feature space. The output depends on whether k-NN is used for classification or regression. In k-NN classification, the output is a class membership. An object is classified by a majority vote … Read more

What is K-means Clustering?

K-means Clustering is a method of vector quantization, originally from signal processing, that is popular for cluster analysis in data mining. K-means clustering aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster. This results in a partitioning of the data space into … Read more

What is K-means Algorithm in machine learning?

K-means Algorithm is one of the simplest unsupervised learning algorithms that solve the well-known clustering problem. The procedure follows a simple and easy way to classify a given data set through a certain number of clusters (assume k clusters) fixed a priori. The main idea is to define k centroids, one for each cluster. These centroids should be placed in … Read more

What is Kernel Trick ?

Kernel Trick is an approach consisting in the use of kernel functions, operating in a high-dimensional, implicit feature space without ever computing the coordinates of the data in that space, but rather by simply computing the inner products between the images of all pairs of data in the feature space. This operation is often computationally cheaper than the explicit computation … Read more

What is Jackknife Resampling (Jacknifing)?

Jackknife Resampling (Jacknifing) in statistics is a resampling technique especially useful for variance and bias estimation. The jackknife predates other common resampling methods such as the bootstrap. The jackknife estimator of a parameter is found by systematically leaving out each observation from a dataset and calculating the estimate and then finding the average of these calculations. Given a sample of … Read more

What is Interpolation?

Interpolation in the mathematical field of numerical analysis, is a method of constructing new data points within the range of a discrete set of known data points. In engineering and science, one often has a number of data points, obtained by sampling or experimentation, which represent the values of a function for a limited number of values of the independent … Read more

What is Intercept?

Intercept is the expected mean value of Y when all X=0. If we start with a regression equation with one predictor, X. If X sometimes is equal zero, the intercept is simply the expected mean value of Y at that value. If X never equals zero, then the intercept has no intrinsic meaning. If so, and if X never = … Read more

What is Information Retrieval (IR) ?

Information Retrieval (IR) is the action/process of obtaining information resources relevant to an information need from a collection of information resources. Searches can be based on full-text or other content-based indexing. Information retrieval is the science of searching for information in a document, searching for documents themselves, searching for metadata that describes data and for databases such as text, image … Read more