Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
FINAL CONFERENCE
SICH PROJECTSemantic Illegal Content Hunter
The SICH Project
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
The SICH Project
Alessandro Capone – Expert System20th November 2015, Rome
SICH: a brief introduction
Project co-financed by EU DG HOME AFFAIRS
ISEC PROGRAMME 2012
Prevention of and fight against crime – Illegal use of internet
PRIORITY CALL: Illegal Use of Internet (INT)
EXPECTED RESULTS :Facilitating the taking down of illegal Internet content through public-private
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Facilitating the taking down of illegal Internet content through public-privatecooperation or blocking access to child pornography or blocking the access toillegal Internet content through public-private cooperation
PROJECT DURATION: from 1st September 2013 to 30th November 2015
PARTNERS: + Advisory Board
WEBSITE: www.sichproject.eu
Expert System
Award-winning,
patented semantic
technology
Global company
and listed on the
AIM exchange
3 of the world’s leading Oil & Gas
companies are our customers
90% of customers extend
their license to other domains
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Proven Technology:
and listed on the
AIM exchange
Millions of global
banking customers
use our software
their license to other domains
Top US and European
federal agencies
work with us
Expert System
We make
information
more findable &
Information
management,
access and dissemination
Collaboration
& Customer
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
more findable &
intelligent to help
your business be
more strategic
& efficientSocial media &
Marketing
optimization
Competitive
intelligence &
Information security
&
Innovation
Customer
Experience
RiSSCRiSSC is a no-profit Research Centre (association), funded in
2005, and dealing with security and crime.
RiSSC aims to contribute at promoting social and cultural development on crime prevention by means of research activities, education/training initiatives and technical assistance projects on the most relevant criminal
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
phenomena and their trends, on the causes/factors that facilitate crime and anti-social behaviour, and on the countermeasures to prevent/reduce both criminal opportunities and criminal impact.
It involves a network of experts and researchers who contribute to a multi-disciplinary approach.
www.rissc.it
Why SICH?
At present, LEA, research institutions, the media and international organizations active in the knowledge of the illegal content online may encounter great difficulties in collecting and managing the mass of documents / contents related to a specific topic and / or crime.
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
For these reasons, tools to support the analysts in the identification, selection and analysis of illegal online content can drastically improve the learning curve of a phenomenon; tools that can, ie, "hunt", illegal content hidden within the plethora of textual content on the Web.
SICH Objectives
OVERALL OBJECTIVE
The overall objective of the Project is to develop an innovative model - merging novel specific ontologies and semantic technology – to help analysts (from LEAs/public-private target groups) to identify and take down online illegal content
SPECIFIC OBJECTIVES
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
SPECIFIC OBJECTIVES
• to define an experimental model on three domains: Xenophobia/Racism, Online Illegal Gambling and NovelPsychoactive Substances – NPS
• to define a specific ontology for each project’s domain and to develop implementing tools, based on the semantic analysis on texts, that facilitates identification of targets from large volumes of unstructured online information coming from open sources
• to transfer the Project results among target groups at EU level
SICH Beneficiaries
• EU Member States' LEAs (including Europol and Interpol)
• NGOs and all those organizations (private or not) involved in the fight against crime on the Internet (hotlines, helplines, Safer Internet Centres, Cybercrime Centres of Excellence Network, etc.)
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
• experts and researchers belonging to those scientific community's groups working on the project issues
• media which want to deal with complex issues having restrict availability of time-to-work
AGENDA
09.00 – 09.30 Registrazione dei partecipantiWelcome and Registration
09.30 – 10.10 Il progetto SICH – Presentazione
The SICH Project
Alessandro Capone – Expert System
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Crimine 2.0: Internet come strumento a supporto del crimine. I casi delle Nuove Sostanze Psicoattive (NPS) e dei crimini d’odio per finalità di razzismo e xenofobia (online hatespeech)
Crime 2.0: Internet as crime facilitator. The case of Novel Psychoactive Substances (NPS) and online hate speech
AGENDA10.10 – 10.45 Cybercrime: vecchi e nuovi scenari
Cybercrime scenarios
Gianluigi Me – Università LUISS – Membro dell’Advisory Board del progetto SICH
10.45 – 11.15 La crescita e la diffusione delle Nuove
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
10.45 – 11.15 La crescita e la diffusione delle Nuove Sostanze PsicoattiveThe emerging of Novel Psychoactive Substances
Roberta Pacifici – Istituto Superiore di Sanità
11.15 – 11.45 Coffee Break
AGENDA
11.45 – 12.25 Il contrasto ai crimini online – case histories in ambito comunitario The fight against Cybercrime – case histories from European experts
INTERPOL’s Investigative Support to the Fight against NPS
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Daoming Zhang – Interpol
Fighting xenophobia, racism and right wing violent extremism content on the internet
Insp. Sara Bento - Polícia Judiciária Portuguese
12.25 – 12.40 Sessione Aperta: Q&AOpen Discussion: Q&A
AGENDA12.40 – 13.40 Pranzo
Lunch
13.40 – 14.10 Realizzazione di un motore di ricerca ottimizzato per la ricerca di contenuti illegali
A search engine optmized for discovering illegal contents
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Luigi Laura – Università La Sapienza
14.10 – 15.10 SICH FOCUS: presentazione ed utilizzo dellapiattaforma SICH per l’analisi del fenomenoNPS e dell’online hate speechSICH FOCUS: Using SICH PLATFORM to analise NPS and Online Hate Speech over the Internet
Alessandro Capone – Expert System
AGENDA15.10 – 15.25 Sessione aperta: Q&A
Open Discussion: Q&A
15.25 – 15.55 Coffee break
15.55 – 16.40 Uno sguardo verso il futuro – l’eredità del progetto SICHA look towards the future – SICH’s legacy
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
A look towards the future – SICH’s legacy
Progetti europei IANCIS e ISODAC
IANCIS EU ProjectMassimo Bernaschi – CNR IaC
Progetto europeo EPS/NPS
EPS/NPS EU Project Elisabetta Bosio – RiSSC
AGENDAL’analisi geografica come nuovo strumento per la prevenzione ed il monitoring della distribuzione delle NPS
The geographical analysis as a new tool for the prevention and monitoring of the distribution of NPS
Michele Ieradi – ESRI Italia
Conclusioni
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Conclusions
Alessandro Capone – Expert System
16.40 – 17.30 Sessioni Pratiche di utilizzo della piattaforma
Practical Sessions
Contacts
Thank you
Alessandro CaponeTechnical Account Manager -
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Technical Account Manager -Expert System
FINAL CONFERENCE
SICH PROJECTSemantic Illegal Content Hunter
SICH FOCUS: Using SICH PLATFORM to analise NPS and
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
SICH FOCUS: Using SICH PLATFORM to analise NPS and Online Hate Speech over the Internet
Alessandro Capone – Expert System20th November 2015, Rome
Why SICH?
At present, LEA, research institutions, the media and international organizations active in the knowledge of the illegal content online may encounter great difficulties in collecting and managing the mass of documents / contents related to a specific topic and / or crime.
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
For these reasons, tools to support the analysts in the identification, selection and analysis of illegal online content can drastically improve the learning curve of a phenomenon; tools that can, ie, "hunt", illegal content hidden within the plethora of textual content on the Web.
Why SICH?
Multiple knowledge bases
Websites, blogs, social media
The volume and the diversity of sources
makes information difficult to manage.
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Documents of various formats
and email
Websites, blogs, social media
and open sources
The common factor is a combination of very specific and generic text. A lot of it.
SICH Objectives
OVERALL OBJECTIVE
The overall objective of the Project is to develop an innovative model - merging novel specific ontologies and semantic technology – to help analysts (from LEAs/public-private target groups) to identify and take down online illegal content
SPECIFIC OBJECTIVES
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
SPECIFIC OBJECTIVES
• to define an experimental model on three domains: Xenophobia/Racism, Online Illegal Gambling and NovelPsychoactive Substances – NPS
• to define a specific ontology for each project’s domain and to develop implementing tools, based on the semantic analysis on texts, that facilitates identification of targets from large volumes of unstructured online information coming from open sources
• to transfer the Project results among target groups at EU level
The SICH systemThe ESCrawler is in charge to:
� download documents from the Web;
� search and extract parts of documents;
� generate documents composed by different documents;
� filter non-core parts of documents;
� populate HTML forms and to get the result;
� Classify documents or parts of them by calculating a
hash signature
Discovery & Categorization by
• Parser
• Semantic Network
• Lexicon
• Knowledge Base
• Memory
Semantic Index
Conceptual Map
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
• linguistic search
• spatial search
• events-based search
• search based on corpora selection
Semantic Search;
Text Mining;
Automatic Categorization;
Data Intelligence.
The SICH system
Web
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Indexer Semantic
IndexCrawler COGITO®
Entity
Extraction
Categorization
Text Mining
The SICH system
Keywords
Concepts
Entities
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Semantic
IndexSearcher
Categories
Corpora And
Sources
Geo
Alerts
Value of understandingWhat does it mean?
What is the context?
What is the sentiment?
Why is it important?
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Why is it important?
“Increasing volumes, variety and velocity — big data— in Information Management and businessoperations, requires semantic technology that makessense out of data for humans, or automatesdecisions.”Source: Gartner Identifies Top Technology Trends Impacting Information Infrastructure in 2013
The power of language
Technology alone struggles to address
language ambiguities or understand
context…
Same Different Different
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Same
Word,
Different
Meanings
Different
Words,
Same
Meaning
Different
Words,
Related
Meaning
Press: push or the
news media?
Buy or purchase? Hollywood or the
U.S. film industry?
Semantic approach
The Cogito® semantic technology can be classified as a software for the understanding and automatic analysis of text.
Unlike other technologies, Cogito does not process content as a sequence of characters and does not guess at the meaning of words and concepts.
Instead, Cogito is a real semantic software that relies on a deep semantic analysis and a rich semantic network to ensure a complete understanding of a text as a person would.
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
text as a person would.
Cogito® Technology
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
What makes technology unique
Our semantic network
is a rich map
of definitions of words
and associations
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
and associations
between words.
2 million concepts and all their different meanings
6 million relationships between these meanings
Example
Searching for word “MDMA”, the SICH system will return a set of documents containing words (or sequence of
characters) as “ecstasy”, “disco biscuit”, “methylenedioxymethamphetamine”, “love drug”. In the NPS domain, the concept, as aggregation of different
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
NPS domain, the concept, as aggregation of different words/lemma having the same meaning, is a very crucial phase, due to continuous updates of blacklists and the
introduction of new NPS in e-shops and fora.
Criminology
Criminology:
• phenomena qualitative analysis
• identification of patterns and trends
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
• analysis of criminal structures (eg. individuals, groups of ethnic origin, organised criminal groups ...)
• analysis of the role of technology as crime’s facilitators
Criminology & Semantic Analysis
Contribution of criminology to the semantic approach:
• desk research / literature analysis
• identifying relevant open sources
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
• identifying specific terminology
• support in the analysis of results and in the re-use of information for research, investigation
NPS
• “a new narcotic or psychotropic drug, in pure form or in preparation, that is not controlled by the 1961 United Nations Single Convention on Narcotic Drugs or the 1971 United Nations Convention on Psychotropic Substances, but which may pose a public health threat comparable to that posed by substances listed in these conventions
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
that posed by substances listed in these conventions (Council Decision 2005/387/JHA)”
• emerging phenomenon since the early 2000s, evolving both offline and online
NPS
• online NPS market: 4 main business segments
1. e-shops that sell the NPS as chemicals, often using their correct chemical name
2. e-shops that sell the NPS using their brand names;
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
2. e-shops that sell the NPS using their brand names;
3. classified advertising banners, often in public sites
4. deep web (eg. Silk Road, Evolution, Agora ...)
Hate speech
It includes all forms of expression that explain, incite, promote or justify racial hatred,
xenophobia, anti-semitism or other forms of hatred based on intolerance, including:
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
intolerance expressed by aggressive nationalism and ethnocentrism, discrimination and hostility
against minorities, migrants and people of immigrant origin
Hate speech
• Categories of interest for the project:1. crimes related to race2. crimes related to religion3. crimes related to sexual orientation
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
• all these categories are registering an increase of cases due to the increasing use of social networks for the purpose of xenophobia and racism
• textual communication, but also figurative
The SICH interface
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
LIVE DEMO
Contacts
Thank you
Alessandro CaponeTechnical Account Manager -
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Technical Account Manager -Expert System
FINAL CONFERENCE
SICH PROJECTSemantic Illegal Content Hunter
Conclusions
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Conclusions
Alessandro Capone – Expert System20th November 2015, Rome
Conclusions
• SICH could have a dramatic impact for crime analysis in Web-based internet complexdomains, where time-to-result is negligibleversus time for information discover
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
• Automatic information discovery is the onlyway to quickly understand complex phenomenonrapidly changing
• SICH could be the first step on a long path
FINAL CONFERENCE
SICH PROJECTSemantic Illegal Content Hunter
Co-funded by the Prevention of and Fight against Crime Programme of the European Union
Practical Sessions
20th November 2015, Rome
Practical Sessions
• Website: http://sich.intelligenceplatform.net
Co-funded by the Prevention of and Fight against Crime Programme of the European Union