Reading Texts on Image by Using Tesseract and PyOCR in Python

    Optical Character Recognition (OCR) is a conversion of typed or handwritten letters on an image into the machine encoded texts.  There are several methods and libraries that can be used to read text on image.

    In this tutorial, we'll briefly learn how to read letters in an image by using the Tesseract and PyOCR in Python. The tutorial covers:

  1. Installing Tesseract and PyOCR
  2. Reading texts on image
  3. Source code listing
   Let's get started.

SelectKBest Feature Selection Example in Python

     Scikit-learn API provides SelectKBest class for extracting best features of given dataset. The SelectKBest method selects the features according to the k highest score. By changing the 'score_func' parameter we can apply the method for both classification and regression data. Selecting best features is important process when we prepare a large dataset for training. It helps us to eliminate less important part of the data and reduce a training time.

    In this tutorial, we'll briefly learn how to select best features of classification and regression data by using the SelectKBest in Python. The tutorial covers:

  1. SelectKBest for classification data
  2. SelectKBest for regression data
  3. Source code listing
   We'll start by loading the required libraries and functions.

Dimensionality Reduction Example with Factor Analysis in Python

     Factor Analysis is a technique that used to express data with reduced number of variables. Reducing the number of variables in a data is helpful method to simplify large dataset by decreasing the variables without loosing the generality of it. 

    The Scikit-learn API provides the FactorAnalysis model that performs a maximum likelihood estimate of  loading matrix using SVD based approach. In this tutorial, we'll briefly learn how to use FactorAnalysis model to reduce the data dimension and visualize the output in Python. The tutorials covers:

  1. MNIST dataset Projection with Factor Analysis
  2. Image data Factor Analysis and visualizing
  3. Source code listing

SparsePCA Projection Example in Python

     Sparse Principal Component Analysis is a type of PCA analysis method. SparsePCA extracts sparse components to build the data.

    The Scikit-learn API provides SparsePCA class to apply Sparse PCA method in Python. In this tutorial, we'll briefly learn how to project data by using SparsePCA and visualize the projected data in Python. The tutorials covers:

  1. Iris dataset SparsePCA projection and visualizing
  2. MNIST dataset SparsePCA projection and visualizing
  3. Source code listing

Anomaly Detection Example With OPTICS Method in Python

    Ordering Points To Identify the Clustering Structure (OPTICS) is an algorithm that estimates density-based clustering structure of a given data. It applies the clustering method similar to DBSCAN algorithm.

    In this tutorial, we'll learn how to apply OPTICS method to detect anomalies in given data. Here, we use OPTIC class of Scikit-learn API. The tutorial covers:

  1. Preparing the data
  2. Anomaly detection with OPTICS
  3. Source code listing

Spectral Clustering Example in Python

    Spectral clustering is a technique to apply the spectrum of the similarity matrix of the data in dimensionality reduction. It is useful and easy to implement clustering method.  

    The Scikit-learn API provides SpectralClustering class to implement spectral clustering method in Python. The SpectralClustering applies the clustering to a projection of the normalized Laplacian. In this tutorial, we'll briefly learn how to cluster and visualize data with SpectralClustering in Python. The tutorial covers:

  1. Preparing the data
  2. Clustering with the SpectralClustering and visualizing
  3. Source code listing