The reason I question the assertion that poverty correlates with crime is that when I plot poverty rates for states vs homicides rates I don't see a correlation. You can check the following data for yourself. When there is a correlation the data points should more or less follow a straight line or at least a curved line. But when I plot this data I get something that looks like a shotgun blast, points all over the place. Now it would probably be more accurate to look at city level or county level data. That takes much more time than even a retired geezer would care to spend. But at least at the state level I just don't see a correlation. Let me know if you disagree.
[data]
Points in statistical analysis very, very rarely follow anything that actually looks like a straight line. You need a correlation coefficient (more on these in a bit) very close to either +1 or -1 for that to happen. More often than not, it
does look like a "shotgun blast", as you put it. It often is not immediately obvious whether or not a correlation exists. However, there are methods of analysis that attempt to find a method in the madness.
The first is the method of correlation coefficients, which I mentioned above. A correlation coefficient, as I said above, is essentially a measure of how correlated two variables are. Given random variables
X and
Y, the correlation coefficient
r is given by
where E(
X) is the expected value of random variable
X, μ
X is the mean value of random variable
X and σ
X is the standard deviation of random variable
X (essentially, how far away from the mean most values of the random variable are). It would take a while to get into all of the details, but it suffices to say that the correlation coefficient is a measure of how related
X and
Y are to each other. It can be anywhere between -1 (a perfect negative correlation) and +1 (a perfect positive correlation). A value
r = 0 would indicate that there is no relation between the variables at all.
Now, we can't calculate
r as above because that's applicable to two random variables, not to two sets of sample data. However, if you go through the process of calculating the correlation coefficient for two sets of sample data (the formula is much like the one above), then you would get that
r = 0.48: not a perfect correlation by any means, but certainly greater than 0, which is what one would expect the correlation coefficient to be close to were there no correlation between the variables at all. Uncorrelated random variables very rarely yield sets of sample data with a sample correlation coefficient greater than 0.2 or less than -0.2.
Another method would be the process of hypothesis testing, but unfortunately, I really should be heading to bed, so I'll have to do this section at a later date. It's safe to say, however, that those data are indeed, in fact, somewhat correlated. It's not a perfect correlation by any means because there obviously would be other factors contributing to homicide rates (I should also note that homicide rates are also not all the crime that might occur due to poverty), but a correlation is actually there, even if it looks like one big mess.