Your browser doesn't support javascript.
loading
Piracema: a Phishing snapshot database for building dataset features.
Gomes de Barros, Julio Cesar; Revoredo da Silva, Carlo Marcelo; Candeia Teixeira, Lucas; Torres Fernandes, Bruno José; Lorenzato de Oliveira, Joao Fausto; Luzeiro Feitosa, Eduardo; Pinheiro Dos Santos, Wellington; Ferraz Arcoverde, Henrique; Cardoso Garcia, Vinicius.
Afiliação
  • Gomes de Barros JC; Escola Politécnica de Pernambuco (POLI), Universidade de Pernambuco (UPE), Recife, PE, 50720-001, Brazil.
  • Revoredo da Silva CM; Escola Politécnica de Pernambuco (POLI), Universidade de Pernambuco (UPE), Recife, PE, 50720-001, Brazil. cmrs@poli.br.
  • Candeia Teixeira L; Escola Politécnica de Pernambuco (POLI), Universidade de Pernambuco (UPE), Recife, PE, 50720-001, Brazil.
  • Torres Fernandes BJ; Escola Politécnica de Pernambuco (POLI), Universidade de Pernambuco (UPE), Recife, PE, 50720-001, Brazil.
  • Lorenzato de Oliveira JF; Escola Politécnica de Pernambuco (POLI), Universidade de Pernambuco (UPE), Recife, PE, 50720-001, Brazil.
  • Luzeiro Feitosa E; Instituto de Computação (IComp), Universidade Federal do Amazonas (UFAM), Manaus, AM, 69080-900, Brazil.
  • Pinheiro Dos Santos W; Departamento de Engenharia Biomédica (DEBM), Universidade Federal de Pernambuco (UFPE), Recife, PE, 50740-560, Brazil.
  • Ferraz Arcoverde H; Centro de Informática (CIn), Universidade Federal de Pernambuco (UFPE), Recife, PE, 50740-560, Brazil.
  • Cardoso Garcia V; Centro de Informática (CIn), Universidade Federal de Pernambuco (UFPE), Recife, PE, 50740-560, Brazil.
Sci Rep ; 12(1): 15149, 2022 09 07.
Article em En | MEDLINE | ID: mdl-36071135
ABSTRACT
Phishing is an attack characterized by attempted fraud against users. The attacker develops a malicious page that is a trusted environment, inducing its victims to submit sensitive data. There are several platforms, such as PhishTank and OpenPhish, that maintain databases on malicious pages to support anti-phishing solutions, such as, for example, block lists and machine learning. A problem with this scenario is that many of these databases are disorganized, inconsistent, and have some limitations regarding integrity and balance. In addition, because phishing is so volatile, considerable effort is put into preserving temporal information from each malicious page. To contribute, this article built a phishing database with consistent and balanced data, temporal information, and a significant number of occurrences, totaling 942,471 records over the 5 years between 2016 and 2021. Of these records, 135,542 preserve the page's source code, 258,416 have the attack target brand detected, 70,597 have the hosting service identified, and 15,008 have the shortener service discovered. Additionally, 123,285 records store WHOIS information of the domain registered in 2021. The data is available on the website https//piracema.io/repository.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Segurança Computacional Idioma: En Revista: Sci Rep Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Brasil

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Segurança Computacional Idioma: En Revista: Sci Rep Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Brasil