Categories
Data Mining

Clustering in Data Mining

Clustering is a data mining process where data are viewed as points in a multidimensional space. Points that are “close” in this space are assigned to the same cluster.

Categories
Data Mining

Frequent Itemset Challenge in Data Mining

In Business Intelligence (and in data mining in general) a regular need is to be able to find the items that frequently go together in a consumer basket.

Categories
Data Mining

Research Data Analysis of Web Traffic

I want to share on the research which was done by some Estonian students concerning web traffic analysis. The case study they undertook is about mining frequent user access patterns from web log files. The primary objective was to discover the most frequent browsing patterns by analyzing the browsing sessions in logs.

Categories
Data Mining

Using Data Mining in Web Traffic Analysis

This short essay is about data mining methods applied in web traffic analysis and other business intelligence. It also provides a  modern look at data mining in light of the Big Data era.

Categories
Data Mining

The stat visualization that makes sense

As I was searching for data mining and data visualization tools I came across the data visualization website Gapminder by Hans Rosling, the professor of Global Health in Karolinska Institute, Sweden. The website presents over a century of statistic data in visual form in graphs, the data being UN and other world organizations out-sourced.

The professor has done an extensive work with plenty of data sources for this data visualizer, and his efforts are notable.

Categories
Data Mining

Data Journalism Handbook Poster

The poster is composed by Liliana Bounegru and Lulu Pinney shortly says what is in the Data Journalism Handbook. This referrence book shows how journalists can produce  interesting news out of data gathered from the web.

Categories
Data Mining

Data Mining with Google Refine

Google Refine is a free tool for data processing, it standing in line with some other free Google data analysis tools. Because of its close association with web scraping, we want to shed some light on it.