Browsed by
Category: data science

Exploring a Python Pandas dataframe

Exploring a Python Pandas dataframe

Panda is open source, the BSD-licensed library used for data analysis in Python programming language. It is build on top of two most essential python packages numpy and matplotlib. Numpy provides multidimensional array objects for easy data manipulation which pandas uses to store data. Matplotlib has powerful data visualization capabilities. It is most popularly used for data manipulation and data visualization. Pandas designed to work with tabular form of data which is also called dataframe….

Read full Article Read More

How to plot a histogram using matplotlib in python?

How to plot a histogram using matplotlib in python?

According to matplotlib official, Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python. Matplotlib has pyplot as a module that provides a MATLAB-like interface. It is designed to be as applicable as MATLAB, with the ability to use python, and being opensource. There are various plotting techniques in matplotlib pyplot like line plot, histogram, scatter plot, 3D plot, Image plot, Contour plot, Scatter plot, Polar plot, Line plot, 3-D plot and…

Read full Article Read More

What is Exploratory Data Analysis (EDA)?

What is Exploratory Data Analysis (EDA)?

Exploratory Data Analysis ( EDA ) is the process of organizing, plotting and summarizing a data set. EDA was developed by one the greatest statistician of all time John W. Tukey. In his book “Exploratory data analysis” in 1977 where the principal for EDA was laid. He said “Exploratory Data Analysis can never be the whole story, but nothing else can serve as the foundation stone.” “Exploratory Data Analysis can never be the whole story,…

Read full Article Read More

What is Machine Learning?

What is Machine Learning?

Machine learning is being used by people in various ways knowingly or unknowingly. Each time when we Google or Bing search we get the best results because they have used machine learning to rank page. The world biggest search engine giant Google offers recommendation and suggestion based on the previous user searches. In 2012, Google introduced Knowledge Graph -an algorithm used to decipher the semantic content of a search query. Facebook uses a face recognition…

Read full Article Read More

What is Histograms of Oriented Gradients?

What is Histograms of Oriented Gradients?

HoGs are a feature descriptor that has been widely and successfully used for object detection. It presents objects as a single feature vector as oppossed to a set of feature vectors where each represents a segment of the image. It’s computed by sliding window detector over an image, where a HOG descriptor is a computed for each position. Like SIFT the scale of the image is adjusted (pyramiding). HOGs are often used with SVM (Support…

Read full Article Read More

What is Support Vector Machine (SVM)?

What is Support Vector Machine (SVM)?

Support Vector Machine is a supervised learning algorithm which can be used for both classification or regression problems. In this algorithm, data points are plotted in an n-dimension space (where n is a number of features) with the value of each feature being the value of a particular coordinate. Then, classification is done by finding hyper-plane separating between two classes. Support Vector Machine is a frontier which best segregates the two classes (hyper-plane/ line). Support…

Read full Article Read More

What is Mask RCNN?

What is Mask RCNN?

Mask RCNN is Faster RCNN (object detection with bounding boxes) with a mask on it. It is developed by Facebook AI Research (FAIR).  The output from Mask RCNN is similar to YOLO (You Only Look Once) but the implementation strategy is different. What does mask do in Mask RCNN? Mask features labels each pixel and compares each pixel with an object. Here is the link of the paper written the Facebook AI Research (FAIR) on…

Read full Article Read More

What is YOLO v2 (aka YOLO 9000)?

What is YOLO v2 (aka YOLO 9000)?

“YOLO” or “You Only Look Once”, is a massive Convolutional Neural network for object detection and classification. YOLO9000 is high speed, real-time object detection algorithm that can detect on over 9000 object categories. YOLO was originally written in a framework called darknet. Darknet is an open-source deep learning framework written in C and CUDA. It is fast, easy to install, and supports both CPU and GPU computation. If you want to know more follow this…

Read full Article Read More

Importing and Exporting data in python with panda

Importing and Exporting data in python with panda

Panda is open source, the BSD-licensed library used for data analysis in Python programming language. It is build on top of two most essential python packages numpy and matplotlib. If you have not installed panda please go through this link and play with the code. Data acquisition is the process of loading and reading data from various sources. To read any data from python panda package, we need to consider two important factor i.e. format…

Read full Article Read More

What are Python Packages for Data Science?

What are Python Packages for Data Science?

A python library is a collection of functions and methods that allow you to perform lots of actions without writing any code. The libraries usually consist of built-in models providing different functionalities, which you can use directly. There are a lot of libraries offering a broad range of facilities. Below are some important python libraries used for data science. It is divided into three group i.e. Scientific Computing libraries, Visualization libraries, and Algorithmic libraries. Scientific…

Read full Article Read More