Skip to content

Latest commit

 

History

History
230 lines (173 loc) · 7.85 KB

INSTALL.md

File metadata and controls

230 lines (173 loc) · 7.85 KB

Installing

Hatrac (pronounced "hat rack") is a simple object storage service for web-based, data-oriented collaboration.

Install code

  1. Check out the development code from GitHub.
  2. Install prerequisites:
  • Apache HTTPD
  • PostgreSQL
  • Python
  • flask
  • webauthn2
  1. Install hatrac Python package from top-level of development code.
  • python setup.py install
  1. Run basic tests or continue to configure the web stack.

Basic testing

You can perform some local testing of Hatrac without configuring the whole web service stack, daemon account, nor daemon account configuration data:

# make sure Hatrac is installed
% python setup.py install

# get rid of any previous test data
% dropdb hatrac_test
% rm -rf hatrac_test_data

# create empty test database
% createdb hatrac_test

# run tests
% python test/smoketest.py

The test script should run to completion without printing anything. If it encounters errors, diagnostics will be printed and the script will exit. You MUST start with an empty test database and empty test data directory prior to each run of the test.

Configure web stack

These steps continue following the basic code installation. For installations using SE-Linux, please see working with SE-Linux below.

  1. Create hatrac daemon account.
  • E.g. useradd --create-home --system hatrac (as root)
  1. Create hatrac PostgreSQL role.
  • E.g. createuser -d hatrac (as PostgreSQL superuser)
  1. Configure ~hatrac/hatrac_config.json
  1. Create and initialize hatrac database.
  • E.g. createdb hatrac (as hatrac user)
  • E.g. hatrac-deploy your_username_or_group (as hatrac user)
  1. Create file storage directory under Apache
  • E.g. mkdir /var/www/hatrac && chown hatrac /var/www/hatrac (as root)
  1. Configure ~hatrac/webauthn2_config.json
  1. Configure mod_wsgi to run Hatrac

The hatrac-deploy step above depends on having a proper ~hatrac/hatrac_config.json file already populated and an empty database already created. It simply initializes the database schema.

If working in a SELinux environment see section below to SELinux specific additional steps needed before running the tests

Example Hatrac configuration

This configuration works on a Fedora and Ubuntu host for a filesystem deployment:

{
    "storage_backend": "filesystem",
    "storage_path": "/var/www/hatrac",
    "database_type": "postgres",
    "database_dsn": "dbname=hatrac",
    "database_schema": "hatrac",
    "database_max_retries": 5,
	"403_html": "<html><body><h1>Access Forbidden</h1><p>%(message)s</p></body></html>",
	"401_html": "<html><body><h1>Authentication Required</h1><p>%(message)s</p></body></html>"
}

This configuration works for an Amazon AWS S3 deployment:

{
    "backend": "amazons3",
    "s3_bucket" : <S3_BUCKET_NAME>,
    "s3_connection": {
    "aws_access_key_id" : <AWS_ACCESS_KEY_ID>,
	    "aws_secret_access_key": <AWS_SECRET_ACCESS_KEY>
    },
    "database_type": "postgres",
    "database_dsb": "dbname=hatrac",
    "database_schema": "hatrac",
    "database_max_retries": 5
}

This configuration block enables an optional static firewall to require clients to have certain attributes in order to perform certain operations:

{
    ...
    "firewall_acls": {
        "create": ["object-or-ns-uploader-group"],
        "delete": ["object-or-ns-deletion-group"],
        "manage_acls": ["acl-admin-group"],
        "manage_metadata": ["metadata-curator-group"]
    }
}

For backwards compatibility, firewall_acls uses a default ACL of ["*"] when an ACL is not configured. This permissive mode then allows the normal fine-grained authorization checks to proceed for each request. A more restrictive firewall ACL can block a request that would normally be allowed due to the fine-grained ACL state in the hatrac namespace hierarchy.

REST API testing

You can perform system testing of the whole web service stack, if configured with cookie-based authentication (such as using a companion ERMrest installation as described in the subsequent configration examples of this document):

# manually create a session cookie
% curl -b cookie -c cookie \
   -d username=testuser -d password=testpassword \
   https://$(hostname)/ermrest/auth/session

# run the test script
% COOKIES=cookie bash test/rest-smoketest.sh

Example Webauthn2 configuration

This configuration allows Hatrac to share an existing Webauthn2 deployment from an ERMrest installation (if ERMrest is configured with the same cookie name and path settings):

{
      "require_client": true,
      "require_attributes": true, 
      "listusers_permit": ["admin"], 
      "listattributes_permit": ["admin"], 
      "manageusers_permit": ["admin"], 
      "manageattributes_permit": ["admin"], 
            
      "session_expiration_minutes": 30, 
      "def_passwd_len": 10, 
      "hash_passwd_reps": 1000,
        
      "sessionids_provider": "webcookie", 
      "sessionstates_provider": "database", 
      "clients_provider": "database", 
      "attributes_provider": "database", 
        
      "handler_uri_usersession": "/ermrest/authn/session", 
        
      "web_cookie_name": "ermrest", 
      "web_cookie_path": "/", 
      "web_cookie_secure": true, 
      "setheader": false,

      "database_schema": "webauthn2", 
      "database_type": "postgres", 
      "database_dsn": "dbname=ermrest", 
      "database_max_retries": 5
}

You will also have to grant ermrest permissions to the hatrac role: GRANT ermrest TO hatrac ; (as postgres user)

Note: at present, Hatrac does not expose any Webauthn2 REST APIs so you MUST share an existing deployment as above if you want to use a local account and session-based login for testing. Additionally, you must then grant the ermrest role to the hatrac role in PostgreSQL so that Hatrac can look up client authentication information at runtime.

Example Apache configuration

See the sample wsgi_hatrac.conf file in the git repo, which would be installed under /etc/httpd/conf.d/ on Red Hat flavored machines:

AllowEncodedSlashes On

WSGIPythonOptimize 1
WSGIDaemonProcess hatrac processes=4 threads=4 user=hatrac maximum-requests=2000
WSGIScriptAlias /hatrac /usr/lib/python2.7/site-packages/hatrac/hatrac.wsgi
WSGIPassAuthorization On

WSGISocketPrefix /var/run/wsgi/wsgi

<Location /hatrac>

   AuthType webauthn
   Require webauthn-optional

   WSGIProcessGroup hatrac
    
</Location>

Working with SE-Linux

The following is an example set of commands to allow Hatrac to write to the filesystem in a Fedora installation. On other distributions, the appropriate path and SE-Linux contexts might vary slightly:

setsebool -P httpd_can_network_connect_db on
semanage fcontext --add --type httpd_sys_rw_content_t "/var/www/hatrac(/.*)?"
restorecon -rv /var/www/hatrac
semanage fcontext --add --type httpd_sys_content_t "/home/hatrac(/.*)?"
restorecon -rv /home/hatrac

Errors regarding WSGI socket

On some versions of Fedora and CentOS, you may encounter errors in the Apache SSL server log similar to Unable to connect to WSGI daemon process on '/var/run/wsgi/wsgi.1331.1.1.sock'.

A workaround is to configure the SE-Linux context:

semanage fcontext --add --type httpd_var_run_t "/var/run/wsgi"
restorecon -rv /var/run/wsgi