webscraping.pro – Page 32

What is MongoDB?

MongoDB Logo MongoDB, an open-source document database written in C++, is classified as a NoSQL database. Because it avoids the traditional table-based relational database structure in favor of JSON-like documents with dynamic schemas (MongoDB calls the format BSON), it facilitates quick-and-easy data integration in various applications.

Tags big data

Development

What is NoSQL?

nosql-expert The term “database” was long synonymous with SQL, and for a while there seemed to be no viable alternative. Recently, however, the realm of data storage has welcomed a new option: NoSQL. This article offers you a brief overview of what NoSQL is and when it may be applied.

Tags big data

Development

MongoDB in a minute

Have you ever heard about MongoDB? It’s a document-oriented NoSQL database. Instead of keeping data in familiar SQL-tables MongoDB keeps them as collections of JSON-like documents.

Intrigued? Read this tutorial and you will get a general impression about this database in just a minute.

Tags big data

Miscellaneous Web Scraping Software

7 Ways to Protect Website from Scraping and How to Bypass this Protection

Post author By admin
Post date May 7, 2014
7 Comments on 7 Ways to Protect Website from Scraping and How to Bypass this Protection

stop-scrape In this article I’d love to revise few well-known methods of protecting website content from automatic scraping. Each one has its advantages and disadvantages, so you need to make your choice basing on the particular situation. None of these methods is ultimate and each one has its own ways around I will mention further.

Tags anti-scrape, scrape detection

Uncategorized

7 Ways to Protect Website from Scraping and How to Bypass this Protection (2)

Post author By admin
Post date May 7, 2014
1 Comment on 7 Ways to Protect Website from Scraping and How to Bypass this Protection (2)

If you are interesting of how to find out if your site is being scraped, then turn to this post: How to detect your site is being scraped?

Tags anti-scrape, scrape detection

Development

Free Online JSON Visual Editor

Post author By admin
Post date April 30, 2014
2 Comments on Free Online JSON Visual Editor

JSONMate is yet another free online JSON visual editor with a modern appearance that allows users to edit, query, and visualize data in JSON format (in case you don’t know, JSON is a human-readable language that is usually used to communicate data between an online web application and a server). Let’s see what is unique about JSONMate.

SEO and Growth Hacking

Competitor Analysis

As strange as it may sound, competitor analysis is a significant part of the marketing process. Any company that hopes to become successful one day needs to look at the way competing businesses handle themselves. A competitive analysis is a document designed to showcase the strengths and weaknesses of other companies in an industry. A technical writer with marketing intelligence can create a strong analysis that allows a business to make good decisions.

Web Scraping Software

Anonymous Scraping with Visual Web Ripper

Post author By admin
Post date April 15, 2014
6 Comments on Anonymous Scraping with Visual Web Ripper

It’s very common to use proxy servers for web data extraction. If you want to stay undetected when you scrape a website, you have to change your IP address periodically. Otherwise it is very easy to detect unusual activity by observing a large number of requests from a single IP address. Visual Web Ripper has a built-in support of proxy servers called Private Proxy Switch.

Tags proxy

Development SEO and Growth Hacking

Google Scraper Hints

google_scraper Being the biggest scraper Google itself doesn’t like when somebody scrapes it. This makes life of google scrapers difficult.

In this post I offer you several hints on how to scrape Google in a safe way (if you still decided to do this).

Development SEO and Growth Hacking

A simple LinkedIn Group Submitter

Post author By admin
Post date April 2, 2014
1 Comment on A simple LinkedIn Group Submitter

LinkedIn API doesn’t allow you to publish into groups if you are not their administrator. That was done in order to eliminate spamming, but if you are a member of several groups of a similar topic and you want to share some interesting information with all of those groups, you have to do it manually group by group and eventually it becomes tedious. In this post I’ll show you a simple way to automate this process in C# using Selenium WebDriver.

Tags business directory, JAVA, LinkedIn, Selenium