Monday, May 24, 2010

Geoda, R, ArcMap

The choice of software to use depends mostly on the objective of the project. For example, one may be asked to create a map showing the distribution of population of the US voters. In this case ArcMap would be an ideal instrument to use. It can generate very informative maps. The following examples illustrate this point. Data from the past presidential votes (2004/8) have been analyzed to determine the density of votes using the Z scores and P values. The results have been successfully visualized in the form of a map, for each presidential candidate. This makes it to compare the data easily.
Z Scores and P values for the US presidential votes 2004/8

The Z scores and p-values are measures of statistical significance that tell you whether to reject the null hypothesis, feature by feature. They, in effect, indicate whether the apparent similarity (or dissimilarity) in values for a feature and its neighbors is greater than one would expect in a random distribution. The Z score is based on the Randomization Null Hypothesis computation. A high positive Z score for a feature indicates that the surrounding features have similar values (either high values or low values). A low negative Z score for a feature indicates a statistically significant (0.05 level) spatial outlier (http://webhelp.esri.com/arcgisdesktop/9.3/index.cfm?TopicName=Cluster_and_Outlier_Analysis:_Anselin_Local_Moran%27s_I_%28Spatial_Statistics%29).
Fig.1. Indicates the P - values and Z Scores for both Kerry and Bush during the 2004 votes


Fig.2. Indicates the P - values and Z Scores for both McCain and Obama during the 2008 votes

Geoda, on the other hand is a very useful tool when it comes to visualizing the correlation between sample. The following figures demonstrate the usefulness of this tool (Geoda).



Fig.3 Moran's plot for Bush


Fig 4. Moran's plot for Kerry


Fig 5. Moran's plot for Obama


Fig 6. Moran's plot for McCain

R, a statistical language is very useful and powerful when it comes to data analysis. Here, in most cases, the user is limited by his/her imagination. The user has the liberty to choose any kind of plot to display the results. R can also be used to generate map, but it is not very good at that.

Conclusion

Depending on the objective of the project the user has liberty to choose any programme to analyze and visualize data.

No comments:

Post a Comment