Command Line Usage Overview

You can run PATHspider from the command line. In order for the Observer to work, you will need permissions to capture raw packets from the network interface. You may also need elevated privileges when generating traffic using raw sockets or to modify the local TCP/IP stack. This will require you to use sudo or equivalent in order to run PATHspider if you are not logged in as the root user.

# pspdr --help
usage: pspdr [-h] [--verbose] COMMAND ...

PATHspider will spider the paths.

optional arguments:
  -h, --help  show this help message and exit
  --verbose   Enable verbose logging

Commands:
    filter    Pre-process a target list
    measure   Perform a PATHspider measurement
    observe   Passively observe network traffic
    test      Run the built in test suite

Spider safely!

Performing Active Measurement

PATHspider provides the “measure” command to perform active traffic generation and observation of that traffic for path transparency measurement. Based on the observations made, paths are assigned conditions such as ecn.connectivity.works indicating that the use of ECN does not cause connectivity impairment between the vantage point and the particular target.

It is possible to enable the output of flow records along with the derived observations using the --output-flows flag. This will generate considerably more output and so is disabled by default.

You may specify input and output files using flags, however by default these are set to be stdin and stdout and so you can, and are recommended to, use shell redirection instead. To see output as it is written to the file, you can pipe the output to tee to print it on the screen while also saving it to a file.

You will be required to set your interface name and PATHspider will not start if it detects that the chosen interface is not active.

# pspdr measure --help
usage: pspdr measure [-h] [-i INTERFACE] [-w WORKERS] [--input INPUTFILE]
                    [--output OUTPUTFILE] [--output-flows]
                    PLUGIN ...

optional arguments:
  -h, --help            show this help message and exit
  -i INTERFACE, --interface INTERFACE
                        The interface to use for the observer. (Default: eth0)
  -w WORKERS, --workers WORKERS
                        Number of workers to use. (Default: 100)
  --input INPUTFILE     A file containing a list of PATHspider jobs. Defaults
                        to standard input.
  --output OUTPUTFILE   The file to output results data to. Defaults to
                        standard output.
  --output-flows        Include flow results in output.

Plugins:
  The following plugins are available for use:

   dscp                Differentiated Services Codepoints
   h2                  HTTP/2
   evilbit             Evil bit connectivity testing
   mss                 TCP Maximum Segment Size
   tfo                 TCP Fast Open
   dnsresolv           Simple Input List DNS Resolver
   ecn                 Explicit Congestion Notification
   udpzero             UDP Zero Checksum

Spider safely!

Quickstart Example

You can run a small study using the ECN plugin and the included webtest.ndjson file to measure path transparency to ECN for a small selection of web servers and save the results in results.ndjson (ensure to change the interface name to match an active interface on your machine):

pspdr measure -i eth0 ecn </usr/share/doc/pathspider/examples/webtest.ndjson >results.ndjson

Note

If you’ve not installed PATHspider from apt, you will find the webinput.ndjson example input file in the examples folder of the source distribution.

Performing Passive Observation

PATHspider provides the “observe” command to perform passive traffic observation for path transparency measurement. In this version of PATHspider we do not attempt to determine path conditions during passive observation, and instead only output flow records. This may change in future versions of PATHspider.

You can list the available chains with --list-chains and then select any number of chains that you would like to use. It is recommended that you include the basic chain as this will add the IP addresses and port numbers to the flow records.

You may specify the output file using a flag, however by default this is set to be /dev/stdout and so you can, and are recommended to, use shell redirection instead. To see output as it is written to the file, you can pipe the output to tee to print it on the screen while also saving it to a file. You will be required to set your interface name and PATHspider will not start if it detects that the chosen interface is not active.

It is also possible to perform offline analysis of a PCAP file using the “observe” command. Instead of an interface name, pass the name of the pcap file to -i instead. The PCAP file must have a .pcap extension to be recognised.

usage: pspdr observe [-h] [--list-chains] [-i INTERFACE] [--output OUTPUTFILE]
                     [chains [chains ...]]

positional arguments:
  chains                Observer chains to use

optional arguments:
  -h, --help            show this help message and exit
  --list-chains         Prints a list of available chains
  -i INTERFACE, --interface INTERFACE
                        The interface to use for the observer. (Default: eth0)
  --output OUTPUTFILE   The file to output results data to. Defaults to
                        standard output.

Quickstart Example

You can observe network traffic passively to perform observations without actively generating traffic. In this case no input file is needed.

pspdr observe -i eth0 basic tcp ecn >results.ndjson

Data Formats

PATHspider uses newline delimited JSON (ndjson) for both the input and output format. The ndjson format gives flexibility in the contents of the data as different tests may require data to remain associated with jobs so that it can be present in the final output (the Alexa ranking of a webserver, for example), or used as part of the test (running tests against authoritative DNS servers and needing to know a domain for which the server should be authoritative).

Input Format

At a minimum, each job should contain an IP address in a dip field. Depending on the plugin in use, more details may be required. Refer to the documentation for the specific plugin for more information.

Output Format

For each job, the output JSON dictionary will contain the original job information, a computed path using available information and a set of conditions seen for the path as generated by the plugins.

With --output-flows enabled, PATHspider’s output will include an additional field in the JSON dictionary for each job containing an array of flow records, one for each configuration. Usually one record will be for the baseline (A) connection, and one for the experimental (B) connection. These JSON records contain the original job information, any information added by the connection functions and any information added by the Observer.

Additionally, internal information may be retained:

Key Description
config 0 for baseline, 1..n for experimental
spdr_state 0 = OK, 1 = TIMEOUT, 2 = FAILED, 3 = SKIPPED

For detail on the values in individual plugins, see the section for that plugin later in this documentation.