Home-sharing Services Data Analysis for Detection of Possible Human Trafficking
Project scope
Categories
Data analysis Information technologySkills
python analytic problem solving modelling machine learning big data
We are looking to gain insight into any emerging trends regarding suspicious activity in home-sharing listings related to fraud, human sex trafficking or any associated illegal activity in Toronto.
Airbnb data is the largest repository of home-sharing data currently available. It is posted publicly at this site: http://insideairbnb.com/get-the-data.html Please use only the Toronto data. You may use any amount of years and any / all files provided including any other relevant open data sources.
Additionally, you may reach out to the website contributors at this email: murray@murraycox.com to get information on deleted postings, or other information which may also be of interest. A useful open data source to consider incorporating in analysis may be the Major Crime Indicators data, available on the Toronto Police Open Data Portal: http://data.torontopolice.on.ca/
Please post all of your code to a GitHub page and provide us with the link to both the consolidated data source(s) you use and any code you create. All of your results must be reproducible and code must run without errors. Please provide us with a 1 page executive summary of key findings and detailed report of up to 10 pages outlining your findings, recommendations, and next steps for further investigation.
Some possible starting points are:
- NLP/ text classification of reviews to flag any concerning comments, especially recurring ones
- Detecting / predicting fraudulent postings
- GIS analysis of locations of postings, in relation to Toronto neighborhood crime data
About the company
To Serve and Protect.
“We are dedicated to delivering police services, in partnership with our communities, to keep Toronto the best and safest place to be.”
Core Values:
- Service at our Core
- Do the right thing
- Connect with Compassion
- Reflect and Grow