jejking / rprng   1.0.13

Apache License 2.0 GitHub

reactive-pseudo-random-number-generator service based on apache commons math and akka.

Scala versions: 2.12 2.11

rprng

A reactive pseudo-RNG web service built on commons math and akka

Run the server

You can start the web application interactively with sbt or an IDE. For now we assume that the server is running on localhost on port 8080.

sbt assembly will produce a fat JAR that can be run directly from the command line using java.

API

Getting bytes

You can obtain bytes in two different styles:

  • fixed size blocks
    • http://localhost:8080/byte/block will deliver 1024 bytes of pseudo-randomness
    • http://localhost:8080/byte/block/${blockSize} will deliver ${blockSize} bytes of pseudo-randomness. The path variable must be a positive integer greater than zero.
  • stream
    • http:localhost:8080/byte/stream will deliver a continuous stream of randomness in chunks of 1024 bytes until the connection is closed

In both cases, the content type is application/octet-stream.

Getting integers

You can also obtain collections of integers grouped in either lists or sets.

  • http://localhost:8080/int/list
  • http://localhost:8080/int/set

In both cases, the following query parameters may be specified:

  • size. The size of the collection to return. Must be a positive integer greater than zero. Defaults to 100.
  • count. The number of collections of the specified size to return. Must be a positive integer greater than zero. Defaults to 1.
  • min. The lowest bound (inclusive) of members of the collection(s) to return. Defaults to Java's Integer.MIN_VALUE. Must be an integer (in the Java sense).
  • max. The top bound (inclusive) of members of the collection(s) to return. Defaults to Java's Integer.MAX_VALUE. Must be an integer (in the Java sense). It must also (obviously) be greater than min and, if a set is being requested, the span between min and max must be greater than size. (Set shuffling is not implemented yet).

A JSON object is returned and the collections are mapped as JSON arrays under the key content.

Getting a Random PNG

You can also have a random PNG generated.

  • http://localhost:8080/png

You may supply the following query parameters:

  • width. Specifies the width of the PNG to generate. Must be a positive integer greater than zero. Defaults to 250.
  • height. Specifies the height of the PNG to generate. Must be a postiive integer greath than zero. Defaults to 250.

The PNG is very simple - it is truecolour with an Alpha Channel, with 8 bit depth, so four bytes per pixel, one for each of Red, Green, Blue and Alpha.

Configuration

Akka has not been specifically configured for the system except for some basic logging settings.

An Typesafe Config application.conf file has been supplied with some options for the port for the web listener, the number of RandomSourceActor instances to be created and the min and max times between actor reseeding.

The following optional environment variables can be set:

  • RPRNG_HTTP_PORT, the http port
  • RPRNG_ACTOR_COUNT, the number of RandomSourceActor instances to create
  • RPRNG_RESEED_MIN, the minimum duration to a reseeding, must be a Typesafe config duration.
  • RPRNG_RESEED_MAX, the maximum duration to a reseeding, must be a Typesafe config duration, and, longer than the minimum duration

Writing Randomness to stdout

If you want to assess the quality of randomness produced by the application, you can run RandomBytesToStandardOut which will start up an actor system and stream chunks of 512 bytes to standard out until the program is terminated.

This stream can be used, for example, as an input to programs such as dieharder.

First, create the fat jar using sbt assembly. Then, start the application and pipe its output to dieharder, for example:

java -cp rprng-assembly-1.0.13-SNAPSHOT.jar com.jejking.rprng.main.RandomBytesToStandardOut | dieharder -a -g 200

The usual caveats apply when using dieharder or similar tools to assess randomness. But when I've run it it appears to be quite reasonable.

Design Overview

RNG Core

The application is relatively simple in its design which is intended to provide two essential qualities:

  • it should be difficult to observe the running service and draw conclusions about the state of the underlying PRNGs and hence to predict future output
  • it should be reasonably performant and provide a robust, non-blocking service

At the heart of the system is the Rng trait which defines methods to obtain a random array of bytes and to reseed some underlying PRNG. There is currently just one regular implementation (CommonsMathRng), which wraps an instance of RandomGenerator from Apache Commons math. The web application uses the IsaacRandom as its underlying PRNG implementation.

As the Apache RandomGenerator instances are not thread-safe, the Rng instances are managed by the RngActor. The external API for the actor is simply RandomByteRequest messages which are responded to using Akka ByteStrings.

The actor not only provides thread-safety guarantees, it also takes responsibility for ensuring that the underlying RNG is properly seeded - and then reseeded at unpredictable intervals. The provision of seed is defined in the SecureSeeder trait and the standard implementation is SecureRandomSeeder which essentially wraps Java's SecureRandom.generateSeed() method which will (normally) delegate to a good quality source of randomness such as /dev/random. Secure seed is obtained during actor startup. The actor uses the Akka scheduling system to schedule reseeding within a configurable interval. As obtaining high quality entropy is blocking, obtaining seed is executed in another thread and the result sent, on completion, back to the actor.

The standard application setup creates a configurable number of RngActor instances on start up and puts them behind a RandomRouter instance which helps to further obfuscate the overall observable state of the PRNGs.

HTTP Layer

Now, the Akka HTTP API which we are using to expose the service to the web, is streams based. It therefore makes sense to expose the RNGs as a stream. To that effect requests create ByteStringSource instances which interact with the actor router and expose a type-safe stream of ByteString instances of the requested size. In order to create Ints and other types there is a further wrapper class EightByteString (for eight bytes, aka 64 bits) and conversion functions exposed on EightByteStringOps.

The stream processing that serves the HTTP API is in the AkkaRoutingHelper. The class ToSizedSet, a custom GraphStage, allows sets of integers of a given size to be produced from a stream of integers.

JSON mapping of RandomIntegerCollectionResponse instances is carried out simply with Spray JSON.

PNG

The PNG implementation is built in a completely streaming fashion and thus obviates the need to work the Java ImageIO API and BufferedImage. It contains exactly what is needed to generated (pseudo)-random PNGs and is hence a long way from being a PNG library. These are Truecolor RGBA of 8 bit depth and pixels are built essentially from four random bytes. Given the randomness there is no sense in trying to use scanline filters, so we just have the no-op one, and compression, whilst built-in as per spec, has little impact.

As PNG is designed to support streaming - by breaking down the image data into 1 - N IDAT chunks - it seems reasonable to build up a PNG generation pipeline in a streaming manner.

Given a request for a PNG of width x height dimensions, we proceed as follows.

  • construct a ByteStreamSource to emit ByteString instances of width * 4 length. These correspond to a single line in the PNG.
  • map these into a PNG scanline by prepending a filter byte
  • pass the scanlines into the IdatStage. This is initialised to compute the "ideal" number of scanlines to collect before emitting an IDAT chunk. Ideal is 32KB / scanline length and discard any fractional part - or 1. I vaguely assume this to be a reasonable size because of the zlib window.
  • the IdatStage consumes scanlines until the target number has been collected, or height lines have been consumed. IDAT chunks are emitted. If height lines have been consumed the stage is closed.
  • IdatStage is wired up to PngStage which emits the PNG Signature, the IHDR chunk, then all the IDAT chunks and closes the format output with the IEND chunk once all IDAT chunks have been passed on.

The PngSourceFactory provides functions to construct a Source[ByteString] according to the above spec. The HTTP API layer simply does some basic validation of the input parameters and maps the source returned by the factory to the HTTP Response Entity with the appropriate Content-Type header.

Load Testing

A Gatling scenario is provided in the gatling directory which can be tweaked as needed.