Advanced Crime Analysis
Data Science for Crime Science
What is this? and Why do we need it?
New research questions!
New ways to solve problems!
Also: uncomfortable!!
–> Lecture 7 + 8
https://www.theverge.com/2018/2/27/17054740/palantir-predictive-policing-tool-new-orleans-nopd
https://www.theverge.com/2018/5/22/17379968/amazon-rekognition-facial-recognition-surveillance-aclu
https://www.theverge.com/2018/5/24/17391632/amazon-facial-recognition-orlando-police-rekognition
–> Lecture 9
https://hbr.org/2018/07/want-less-biased-decisions-use-algorithms
–> Lecture 9
http://firstmonday.org/ojs/index.php/fm/article/view/7126/6522
–> Lecture 7, 8, 9
https://www.nature.com/articles/d41586-018-05285-9
–> Lecture 4 + 5
–> Lecture 4 + 5
Data Science Wild West
You’ll learn to tell hype from promise!
Principle 1: There’s no magic in Data Science
Principle 2: Golden data never comes in a spreadsheet
Principle 3: Data treasures are hidden in front of you
All names of current FBI most wanted terrorists?
Let’s start here: https://www.fbi.gov/wanted/terrorism
## Loading required package: xml2
## [1] "SHAYKH AMINULLAH"
## [2] "FAKER BEN ABDELAZZIZ BOUSSORA"
## [3] "ABDULLAH AL-RIMI"
## [4] "IBRAHIM SALIH MOHAMMED AL-YACOUB"
## [5] "RAMADAN ABDULLAH MOHAMMAD SHALLAH"
## [6] "ABDELKARIM HUSSEIN MOHAMED AL-NASSER"
## [7] "ALI ATWA"
## [8] "ABDUL RAHMAN YASIN"
## [9] "HUSAYN MUHAMMAD AL-UMARI"
## [10] "ALI SAED BIN ALI EL-HOORIE"
## [11] "ABD AL AZIZ AWDA"
## [12] "AHMAD IBRAHIM AL-MUGHASSIL"
## [13] "JABER A. ELBANEH"
## [14] "JAMEL AHMED MOHAMMED ALI AL-BADAWI"
## [15] "MOHAMMED ALI HAMADEI"
## [16] "AYMAN AL-ZAWAHIRI"
## [17] "AHMAD ABOUSAMRA"
## [18] "ADNAN G. EL SHUKRIJUMAH"
## [19] "ABDERRAOUF JDEY"
## [20] "RADDULAN SAHIRON"
## [21] "JEHAD SERWAN MOSTAFA"
## [22] "LIBAN HAJI MOHAMED"
## [23] "LEO FREDERICK BURT"
## [24] "ISHMAIL MUSLIM ALI"
## [25] "JOSE ESPINOSA CABALLERO"
## [26] "EDUARDO GUERRA JIMENEZ"
## [27] "AMBROSE HENRY MONTFORT"
## [28] "JEAN-PIERRE CHARETTE"
## [29] "ALAIN ALLARD"
## [30] "HASAN IZZ-AL-DIN"
## [31] "SIRAJUDDIN HAQQANI"
## [32] "AMER EL-MAATI"
## [33] "GEORGE EDWARD WRIGHT"
## [34] "MUHAMMAD AHMED AL-MUNAWAR"
## [35] "MUHAMMAD ABDULLAH KHALIL HUSSAIN AR-RAHAYYAL"
## [36] "WADOUD MUHAMMAD HAFIZ AL-TURKI"
## [37] "JAMAL SAEED ABDUL RAHIM"
## [38] "ABDULLAH AHMED ABDULLAH"
## [39] "SAIF AL-ADEL"
## [40] "GHAZI NASR AL-DIN"
library(rvest)
target_page = read_html('https://www.fbi.gov/wanted/terrorism')
target_page %>%
html_nodes('p.name') %>%
html_text()
## [1] "SHAYKH AMINULLAH"
## [2] "FAKER BEN ABDELAZZIZ BOUSSORA"
## [3] "ABDULLAH AL-RIMI"
## [4] "IBRAHIM SALIH MOHAMMED AL-YACOUB"
## [5] "RAMADAN ABDULLAH MOHAMMAD SHALLAH"
## [6] "ABDELKARIM HUSSEIN MOHAMED AL-NASSER"
## [7] "ALI ATWA"
## [8] "ABDUL RAHMAN YASIN"
## [9] "HUSAYN MUHAMMAD AL-UMARI"
## [10] "ALI SAED BIN ALI EL-HOORIE"
## [11] "ABD AL AZIZ AWDA"
## [12] "AHMAD IBRAHIM AL-MUGHASSIL"
## [13] "JABER A. ELBANEH"
## [14] "JAMEL AHMED MOHAMMED ALI AL-BADAWI"
## [15] "MOHAMMED ALI HAMADEI"
## [16] "AYMAN AL-ZAWAHIRI"
## [17] "AHMAD ABOUSAMRA"
## [18] "ADNAN G. EL SHUKRIJUMAH"
## [19] "ABDERRAOUF JDEY"
## [20] "RADDULAN SAHIRON"
## [21] "JEHAD SERWAN MOSTAFA"
## [22] "LIBAN HAJI MOHAMED"
## [23] "LEO FREDERICK BURT"
## [24] "ISHMAIL MUSLIM ALI"
## [25] "JOSE ESPINOSA CABALLERO"
## [26] "EDUARDO GUERRA JIMENEZ"
## [27] "AMBROSE HENRY MONTFORT"
## [28] "JEAN-PIERRE CHARETTE"
## [29] "ALAIN ALLARD"
## [30] "HASAN IZZ-AL-DIN"
## [31] "SIRAJUDDIN HAQQANI"
## [32] "AMER EL-MAATI"
## [33] "GEORGE EDWARD WRIGHT"
## [34] "MUHAMMAD AHMED AL-MUNAWAR"
## [35] "MUHAMMAD ABDULLAH KHALIL HUSSAIN AR-RAHAYYAL"
## [36] "WADOUD MUHAMMAD HAFIZ AL-TURKI"
## [37] "JAMAL SAEED ABDUL RAHIM"
## [38] "ABDULLAH AHMED ABDULLAH"
## [39] "SAIF AL-ADEL"
## [40] "GHAZI NASR AL-DIN"
1
00:00:00,000 --> 00:00:01,829
although there's no hard evidence<font color="#E5E5E5"> to</font>
2
00:00:01,829 --> 00:00:03,419
support Warren's claim of Native
3
00:00:03,419 --> 00:00:06,060
American<font color="#E5E5E5"> ancestry she has cited family</font>
4
00:00:06,060 --> 00:00:09,990
<font color="#E5E5E5">lore and not just a stray remarks about</font>
5
00:00:09,990 --> 00:00:12,750
her cheekbones<font color="#E5E5E5"> like that could be in the</font>
6
00:00:12,750 --> 00:00:14,190
<font color="#CCCCCC">onion</font><font color="#E5E5E5"> that literally could be in the</font>
7
00:00:14,190 --> 00:00:18,390
onion<font color="#E5E5E5"> all right</font><font color="#CCCCCC"> this is a special</font>
But: I don’t need this “programming” for this!
Matter of volume
More on learning outcomes in the module handbook
Build your own machine learning models to predict whether a news article is fake or not
your own capstone project
Teaching assistant: Felix Soldner
Homework for today:
Tomorrow’s tutorial: “WTF!? session”
Next week: Web scraping