Learning Likely Locations

We show that people’s travel destinations are predictable based on simple features of their home and destination. Using geotagged Twitter data from over 200,000 people in the U.S., with a median of 10 visits per user, we use machine learning to classify whether or not a person will visit a given location. We find that travel distance is the most important predictive feature. Ignoring distance, using only demographic features pertaining to race, age, income, land area, and household density, we can predict travel destinations with 84% accuracy. We present a careful analysis of the power of individual and grouped demographic features to show which ones have the most predictive impact for where people go.