OnScenes
  • OnScenes
  • News
  • Art
    • Music >
      • Album Review
    • Poetry
    • Film >
      • Filmmakers >
        • Movies
    • Theater >
      • TheaterMakers
  • Philosophy
  • PhiloFiction
  • Science&Technology
  • Economy
  • Media
    • Video
    • Audio
  • About
  • Contact
    • Location

The Illicit Trade of Firearms, Explosives and Ammunition on the Dark Web

10/26/2017

0 Comments

 
by Himanshu Damle
Picture
DATACRYPTO is a web crawler/scraper class of software that systematically archives websites and extracts information from them. Once a cryptomarket has been identified, DATACRYPTO is set up to log in to the market and download its contents, beginning at the web page fixed by the researchers (typically the homepage). After downloading that page, DATACRYPTO parses it for hyperlinks to other pages hosted on the same market and follows each, adding new hyperlinks encountered, and visiting and downloading these, until no new pages are found. This process is referred to as web crawling. DATACRYPTO then switches from crawler to scraper mode, extracting information from the pages it has downloaded into a single database.
loading...
One challenge connected to crawling cryptomarkets arises when, despite appearances to the contrary, the crawler has indexed only a subset of a marketplace’s web pages. This problem is particularly exacerbated by sluggish download speeds on the Tor network which, combined with marketplace downtime, may prevent DATACRYPTO from completing the crawl of a cryptomarket. DATACRYPTO was designed to prevent partial marketplace crawls through its ‘state-aware’ capability, meaning that the result of each page request is analysed and logged by the software. In the event of service disruptions on the marketplace or on the Tor network, DATACRYPTO pauses and then attempts to continue its crawl a few minutes later. If a request for a page returns a different page (e.g. asking for a listing page and receiving the home page of the cryptomarket), the request is marked as failed, with each crawl tallying failed page requests.
DATACRYPTO is programmed for each market to extract relevant information connected to listings and vendors, which is then collected into a single database:
  1. Product title;
  2. Product description;
  3. Listing price;
  4. Number of customer feedbacks for the listing;
  5. The country or region from which a vendor ships the product;
  6. The country or regions to which the vendor placing the listing is willing to ship.
loading...
DATACRYPTO is not the first crawler to mirror the dark web, but is novel in its ability to pull information from a variety of cryptomarkets at once, despite differences in page structure and naming conventions across sites. For example, “$…” on one market may give you the price of a listing. On another market, price might be signified by “VALUE…” or “PRICE…” instead.
​
Researchers who want to create a similar tool to gather data through crawling the web should detail which information exactly they would like to extract. When building a web crawler it is, for example, very important to carefully study the structure and characteristics of the websites to be mirrored. Before setting the crawler loose, ensure that it extracts and parses correct and complete information. Because the process of building a crawler-tool like DATACRYPTO can be costly and time consuming, it is also important to anticipate on future data needs, and build in capabilities to extract that kind of data later on, so no large future modifications are necessary.
Building a complex tool like DATACRYPTO is no easy feat. The crawler needs to be able to copy pages, but also stealthily get around CAPTCHAs and log itself in onto the TOR server. Due to their bulkiness, web crawlers can place a heavy burden on a website’s server, and are easily detected due to their repetitive pattern moving between pages. Site administrators are therefore not afraid to IP-ban badly designed crawlers from their sites.
The Illicit Trade of Firearms Explosives and Ammunition on the Dark Web
Taken from:
altexploit
0 Comments



Leave a Reply.

    Science&Technology

    All

    Archives

    March 2020
    February 2020
    October 2019
    September 2019
    March 2019
    February 2019
    January 2019
    August 2018
    May 2018
    January 2018
    December 2017
    November 2017
    October 2017

    RSS Feed

    Alexander Galloway - ARE ALGORITHMS BIASED?
    Alexander Galloway -A LIST OF QUALITIES
    Achim Szepanski - CELLULAR AUTOMATA AND MACHINE 4.0
    Achim Szepanski - DELEUZE/GUATTARIS DIAGRAM
    Achim Szepanski - GILBERT SIMONDON, HIGH FREQUENCY TRADING AND ECOTECHNOLOGY
    Achim Szepanski - PARANOIA MACHINES OF THE STATE
    David Beyer - The Future of Machine Intelligence
    David Roden - New Substantivism in Philosophy of Technology
    DEEP LEARNING LIBRARY
    Geoff Manaugh - The Ghost of Cognition Past, or Thinking Like An Algorithm
    Himanshu Damle - Acceleration in String Theory–Savdeep Sethi
    Himanshu Damle - Black Holes
    Himanshu Damle - Nomological Unification and Phenomenology of Gravitation
    Himanshu Damle - Superstrings as Grand Unifier
    Himanshu Damle - The Coming Swarm DDoS Actions, Hacktivism, and Civil Disobedience on the Internet
    Himanshu Damle - The Illicit Trade of Firearms, Explosives and Ammunition on the Dark Web
    Max Haiven - THE POLITICS OF AI-DRIVEN FINANCIALIZATION (INTERVIEW WITH MAX HAIVEN)
    McKenzie Wark - Blog-Post for Cyborgs
    Paul HANDLEY - Data swamped US spy agencies put hopes on artificial intelligence
    open culture - George Orwell Predicted Cameras Would Watch Us in Our Homes; He Never Imagined We’d Gladly Buy and Install Them Ourselves
    Open Culture - This Is Your Brain on Exercise
    Rouvroy/Stiegler - THE DIGITAL REGIME OF TRUTH: FROM THE ALGORITHMIC GOVERNMENTALITY TO A NEW RULE OF LAW
    Steven Craig Hickman - The Cosmology of Nick Land: Bataille, Gnosticism, and Contemporary Physics
    Steven Craig Hickman - Fear of Technology: Being Alone Together in the Machine
    Steven Craig Hickman - Philip K. Dick, William Gibson and Science Experiments: Information from the Future
    The Climate changes in the time of Haarp weather systems
Powered by Create your own unique website with customizable templates.
  • OnScenes
  • News
  • Art
    • Music >
      • Album Review
    • Poetry
    • Film >
      • Filmmakers >
        • Movies
    • Theater >
      • TheaterMakers
  • Philosophy
  • PhiloFiction
  • Science&Technology
  • Economy
  • Media
    • Video
    • Audio
  • About
  • Contact
    • Location