Since I’ve now completed the series on website analytics and data analysis, I want to sum up these posts, most of them pertaining to analytics performed with Google spread computing.
Google has been the pioneer in using distributed computation algorithms. It has accumulated statistics and analytics as to web data since 2004. So, we are now the beneficiaries of this mass data content and spread computation techniques. Below I briefly mention the tools (all of them are free) that contribute to modern data analysis.
Google Correlate is the tool for finding words/search terms from the Google warehouse which match a given real-world time trend. Google Correlate works in reverse to Google Trends, the former (Google Correlate) illustrates tendencies for a specific word while the latter retrieves from the databank the keywords whose trends are similar to any given trend. The trend can be acquired either through a certain word trend or by inputting it from CSV or another kind of time series data file.
Google Refine is a handy tool for refining, cleaning up and structuring messy data. It’s of much use when post scrape raw data from several sources are to be processed. This can best be applied in working with often messy government data sets. Besides removing inconsistencies, this tool allows the augmentation of data with information from the Google friendly FreeBase database. Some other data mining methods are also present in this service.
Google Webmasters Tools is a necessary tool to gain insight on how your website is viewed and hit through organic search results. It’s excellent in monitoring what people preview in search results when running search queries. In addition, this tool is embeddable into Google Analytic, helping users to monitor not just the site’s traffic but each keyword search tendency and click through rate.
5. Google Analytics