webscraping.pro – Page 41

Anti Web Scraping WordPress Plugins Review

Post author By admin
Post date February 19, 2013
No Comments on Anti Web Scraping WordPress Plugins Review

As we have been considering web scraping for positive use, there is also the aspect of the negative use of scraping for the purpose of stealing other bloggers’ proprietary content. Let’s consider some anti web scraping WP plugins.

As for a web content ownership the main indicator here is the indexing done mainly by Google. This means that if the content is scraped and immediately reposted, Google might be fooled to index it as the original, while the genuine source will be counted as content farming. Higher ranking sites might have better chances of being indexed earlier than sites with the original content, and the latter might even get a mark for being spam. This is not necessarily a tendency, but in the past some precedents have happened. This seems ridiculous, but through a published feed the offenders might detect and quickly scrape the original content for repost.

Tags anti-scrape, plugin

Web Scraping Software

Web Scraper Shortcode WordPress Plugin Review

Post author By admin
Post date February 19, 2013
2 Comments on Web Scraper Shortcode WordPress Plugin Review

This short post is on the WP-plugin called Web Scraper Shortcode, that enables one to retrieve a portion of a web page or a whole page and insert it directly into a post. This plugin might be used for getting fresh data or images from web pages for your WordPress driven page without even visiting it. More scraping plugins and sowtware you can find in here.

Data Science

Implementing frequent itemsets algorithm thru MapReduce

Post author By admin
Post date February 18, 2013
No Comments on Implementing frequent itemsets algorithm thru MapReduce

The problem of finding frequent itemsets in data analysis is described in this post, and here i state the practical steps for finding the frequent itemsets thru MapReduce.

Tags data mining

Data Science

Data Mining: The AdWords Problem Review

Post author By admin
Post date February 15, 2013
No Comments on Data Mining: The AdWords Problem Review

This post is a continuation of the previous post on Advertising on the Web and Data mining. Here we conclude by reviewing some basic algorithms for placing ads on the web.

Tags data mining, Google

Data Science

Advertising on the Web and Data mining

Post author By admin
Post date February 14, 2013
No Comments on Advertising on the Web and Data mining

The challenge of effective web advertisement primarily involves placing relevant ads on user requested web pages. Those ads must be relevant to a page receiver, that is relevant to the page context and/or directly to the user. What algorithms are being used for this? What trends are there now in business intelligence and data mining for digital advertisement solutions?

Tags data mining

Uncategorized

Clustering in a Parallel Environment and MapReduce

Post author By admin
Post date February 13, 2013
No Comments on Clustering in a Parallel Environment and MapReduce

As we have touched on some basics on Clusters in Data Mining, we want to consider the computation techniques applied for clusters. Those techniques stand in line with the data mining for web traffic analysis.

Tags data mining, Google, MapReduce

Development

How to scrape CSV data files

Post author By admin
Post date February 11, 2013
2 Comments on How to scrape CSV data files

This short post in to guide you in how to scrape CSV data files.

Tags Python

Data Science

Clustering in Data Mining

Post author By admin
Post date February 10, 2013
No Comments on Clustering in Data Mining

Clustering is a data mining process where data are viewed as points in a multidimensional space. Points that are “close” in this space are assigned to the same cluster.

Tags data mining

Data Science

Frequent Itemset Challenge in Data Mining

Post author By admin
Post date February 8, 2013
No Comments on Frequent Itemset Challenge in Data Mining

In Business Intelligence (and in data mining in general) a regular need is to be able to find the items that frequently go together in a consumer basket.

Tags analytics, data mining

Miscellaneous

Ethical issues of using employee monitoring software

Post author By admin
Post date February 5, 2013
No Comments on Ethical issues of using employee monitoring software

Employee monitoring software has become commonplace. Many apps take monitor screenshots, capture keystrokes and mouse movements, monitor active applications and visited sites and, in extreme cases, can even take pictures using webcam. It seems to be fair to track what your employees do when they are being paid for their time. After all, if they exchange their time for money, it seems fair for the employer to know what they are paying for. So, why does it still feel morally inappropriate in some cases? The question is far from being just theoretical. If a wrong decision is made, a company may suffer from lawsuits, experience a backlash and overall productivity drop (opposite from what was intended) from their employees or suffer damage to the company’s image. Let’s review in more detail what employee monitoring practices can be considered valid and what should be avoided.