golem-architecture

Abstract

This GAP introduces a manifest file specification and a set of GCRS (Golem Computation Resource Standards) properties (herein after referred to as “Computation Manifest”), which allow Requestors to declare the constraints of computations they may execute on a Provider’s machine.

Motivation

The need for a Computation Manifest originates from the upcoming Internet access feature, where each outbound request is routed via Provider’s host operating system. In order to prevent unwanted / illegal behavior on Provider’s behalf, they need to remain in control of actions that Requestors can execute on their machine.

With Computation Manifests, Requestors constrain themselves to a certain set of allowed actions, to be negotiated with and approved by a Provider. Requestors’ actions will be verified against the Manifest during computation.

Specification

Computation Manifest can take one of the following forms:

The Computation Manifest is expected to be included in a Computation Payload Manifest (GAP) file, where its hash is co-signed with Payload hashes. If a Computation Manifest is included in the Demand (and thus, Agreement), its role is to derive the properties from and override the initially sourced Manifest.

GCRS golem.srv.comp.manifest namespace

Intended for overriding an external Manifest but allows defining a new Manifest from the ground up using properties only.

  1. golem.srv.comp.manifest.version : String

    Specifies a version (Semantic Versioning 2.0 specification) of the manifest, defaults to “0.1.0”

  2. golem.srv.comp.manifest.script (sub-namespace)

    Defines a set of allowed ExeScript commands and applies constraints to their arguments.

  3. golem.srv.comp.manifest.script.commands : List[String]

    Specifies a curated list of commands in form of:

    • UTF-8 encoded strings

      No command context or matching mode need to be specified.

      E.g. ["run /bin/cat /etc/motd", "run /bin/date -R"]

    • UTF-8 encoded JSON strings

      Command context (e.g. env) or argument matching mode need to be specified for a command.

      E.g. ["{\"run\": { \"args\": \"/bin/date -R\", \"env\": { \"MYVAR\": \"42\", \"match\": \"strict\" }}}"]

    • mix of both

    "deploy", "start" and "terminate' commands are always allowed. These values become the default if no manifest.script.command property has been set in the Demand, but the manifest namespace is present.

  4. golem.srv.comp.manifest.script.match : String

    Selects a default way of comparing command arguments stated in the manifest and the ones received in the ExeScript, unless stated otherwise in a command JSON object.

    The match property could be one of:

    • "strict": byte-to-byte argument equality (default)
    • "regex": treat arguments as regular expressions

      Syntax: Perl-compatible regular expressions (UTF-8 Unicode mode), w/o the support for look around and backreferences (among others); for more information read the documentation of the Rust regex crate.

  5. golem.srv.comp.manifest.net (sub-namespace)

    Applies constraints to networking. Currently, outgoing requests to the public Internet network are covered.

  6. golem.srv.comp.manifest.net.inet.out.protocols : List[String]

    List of allowed outbound protocols. Currently fixed at ["http", "https"].

  7. golem.srv.comp.manifest.net.inet.out.urls : List[String]

    List of allowed external domains that outbound requests can be sent to.
    E.g. ["http://golemfactory.s3.amazonaws.com", "https://github.com", "tcp://5.6.7.8:33/", "udp://1.1.1.1:853/"]

    Port number is inferred from protocol in domain name and doesn’t have to be explicit. Access to non-standard port should be always specified.

    Note: There is difference between “https://github.com” and “http://github.com” and they should be separate entries.

    If unrestricted outbound access is requested this property must not be set.

  8. golem.srv.comp.manifest.net.inet.out.unrestricted.urls : true

    This property means that the payload requires unrestricted outbound access. When present the value is always true. Either this property or the URL list in golem.srv.comp.manifest.net.inet.out.urls must be present.

    The manifest must be considered invalid and outbound access should not be permitted in the following scenarios:

    • neither golem.srv.comp.manifest.net.inet.out.unrestricted.urls nor golem.srv.comp.manifest.net.inet.out.urls is present
    • both golem.srv.comp.manifest.net.inet.out.unrestricted.urls and golem.srv.comp.manifest.net.inet.out.urls are present
    • property golem.srv.comp.manifest.net.inet.out.unrestricted.urls is present and it contains a value different from true

Example

{
  "golem.srv.comp.manifest.script.match": "regex",
  "golem.srv.comp.manifest.script.commands": [
    "run /bin/cat /etc/motd",
    "{\"run\": { \"args\": \"/bin/date -R\", \"env\": { \"MYVAR\": \"42\", \"match\": \"strict\" }}}"
  ],
  "golem.srv.comp.manifest.net.inet.out.protocols": [
    "http",
    "https"
  ],
  "golem.srv.comp.manifest.net.inet.out.urls": [
    "http://golemfactory.s3.amazonaws.com/file1",
    "http://golemfactory.s3.amazonaws.com/file2"
  ]
}

Object representation

Example

  1. JSON

     {
       "script": {
         "match": "regex",
         "commands": [
           "run /bin/cat /etc/motd",
           {
             "run": {
               "args": "/bin/date -R",
               "env": {
                 "MYVAR": "42"
               },
               "match": "strict"
             }
           }
         ]
       },
       "net": {
         "inet": {
           "out": {
             "protocols": [
               "http",
               "https"
             ],
             "urls": [
               "http://golemfactory.s3.amazonaws.com/file1",
               "http://golemfactory.s3.amazonaws.com/file2"
             ]
           }
         }
       }
     }
    
  2. Imploded JSON (optional)

     {
       "script.match": "regex",
       "script.commands": [
         "run /bin/cat /etc/motd",
         {
           "run": {
             "args": "/bin/date -R",
             "env": {
               "MYVAR": "42"
             },
             "match": "strict"
           }
         }
       ],
       "net.inet.out.protocols": [
         "http",
         "https"
       ],
       "net.inet.out.urls": [
         "http://golemfactory.s3.amazonaws.com/file1",
         "http://golemfactory.s3.amazonaws.com/file2"
       ]
     }
    
  3. YAML

     ---
     script:
       match: regex
       commands:
       - run /bin/cat /etc/motd
       - run:
           args: "/bin/date -R"
           env:
             MYVAR: '42'
           match: strict
     net:
       inet:
         out:
           protocols:
           - http
           - https
           urls:
           - http://golemfactory.s3.amazonaws.com/file1
           - http://golemfactory.s3.amazonaws.com/file2
    

Rationale

External Manifest sourcing

The main purpose of external sourcing is to provide a trusted set of actions approved by the signing authority (e.g. Golem Factory). The prevalent case of sourcing would be to download the Manifest included within a Computation Payload. This way, in most common cases, Requestors won’t need to provide a custom Computation Manifest.

Usefulness of the new property set

Because some Providers may be equipped with a custom set of filters, they may e.g. whitelist a broader range of domain names. This way, the Manifests crafted by Requestors or the ones overriding a trusted Manifest via properties may still be accepted by Providers.

Lack of signatures

The manifest is meant to be co-signed with the payload by a trusted authority (i.e. bundled) or signed by a Requestor as a part of the Agreement.

Backwards Compatibility

Certain Runtime capabilities (e.g. Internet access) will require presence of the Computation Manifest. Requestors expecting the new capability will need to include a Computation Manifest in the Demand, otherwise it may be discarded by Providers (unless it’s included in the Computation Payload).

Computation Manifests are not required by the legacy Runtime capabilities and negotiations will commence as prior to introducing this feature.

In some versions of the reference implementation yagna when neither golem.srv.comp.manifest.net.inet.out.unrestricted nor golem.srv.comp.manifest.net.inet.out.urls is present, but the payload is signed (as described in Computation Payload Manifest) the provider can allow unrestricted access to the requestor. This is not intended and will not work in future releases. The intention is that unrestricted access must be explicitly requested.

Test Cases

N/A

[Optional] Reference Implementation

N/A

Security Considerations

Copyright and related rights waived via CC0.