tiger-data-app

TigerData is a comprehensive set of data storage and management tools and services that provides storage capacity, reliability, functionality, and performance to meet the needs of a rapidly changing research landscape and to enable new opportunities for leveraging the power of institutional data.

This application provides a front end for users to create and manage projects that live in the TigerData infrastructure.

Documentation

Auto-built code documentation is avalable at https://pulibrary.github.io/tiger-data-app/}
Design documents and meeting notes are in Google Drive
RDSS internal notes are in a separate directory
A set of requirements derived from early sketches is here.
We're writing a "Missing Manual" for the subset of Mediaflux that is used by TigerData.

Structure

These are our initial plans: In the eventual implementation different systems (Mediaflux, PostgreSQL, LDAP) may have responsibility for different bits of data. Cardinality constraints (projects must have sponsors, etc.) will be enforced in software.

erDiagram
  Project }|--|{ User : "via metadata_json"
  User ||--o{ UsersRoles : ""
  Role ||--o{ UsersRoles :  ""

  Project {
    integer created_by_user_id
    integer mediaflux_id
    jsonb metadata_json "hash with metadata"
    datetime created_at
    datetime updated_at
  }

  Role {
    string name
    string resource "default nil role applies to all Objects"
  }

  UsersRoles {
    int user FK
    int role FK
  }

  User {
    string uid "Princeton netid"
    string provider "should always be cas"
    string email
    string given_name
    string family_name
    string display_name

  }

Roles

The system will eventually have many roles. Please refer to the docs for a description of the system roles

Local development

Setup

Check out code and cd
Install tool dependencies; If you've worked on other PUL projects they will already be installed.
1. Lando
2. asdf
3. postgresql (brew install postgresql: PostgreSQL runs inside a Docker container, managed by Lando, but the pg gem still needs a local PostgreSQL library to install successfully.)
Install asdf dependencies with asdf
1. asdf plugin add ruby
2. asdf plugin add node
3. asdf plugin add yarn
4. asdf plugin add java
5. asdf install
6. ... but because asdf is not a dependency manager, if there are errors, you may need to install other dependencies. For example: brew install gpg
OR - Install dependencies with brew and chruby
1. ruby-install 3.2.3 -- --with-openssl-dir=$(brew --prefix openssl@1.1)
2. If you get "error: use of undeclared identifier 'RUBY_FUNCTION_NAME_STRING'" while updating, make sure your Xcode toolks are up to date.
3. close the terminal window and open a new terminal
4. chruby 3.2.3
5. ruby --version
Install language-specific dependencies
1. bundle install
2. yarn install

On a Mac with an M1 chip, bundle install may fail. This suggestion helped:

gem install eventmachine -v '1.2.7' -- --with-openssl-dir=$(brew --prefix libressl)
brew install pkg-config
bundle install

Starting / stopping services

We use lando to run services required for both test and development environments.

Start and initialize database services with:

bundle exec rake servers:start

To stop database services:

bundle exec rake servers:stop or lando stop

You will also want to run the vite development server:

bin/vite dev

Populate the authorized users table

Authentication and authorization is restricted to a few selected users. Make sure to run the rake task to pre-populate your local database:

bundle exec rake load_users:from_registration_list

If your name is not on the registration list see steps below under "User Registration List" for instructions on how to add yourself.

MediaFlux Server

Documentation for starting the mediaflux server can be found at doc/local_development

Once mediaflux is running locally
1. bundle exec rake schema:create

Authentication

By default, there exists for the MediaFlux deployment a user account with the following credentials:

domain: system
user: manager
password: change_me

Alternatively, one may please use docker/bin/shell to create a terminal session within the container and find individual accounts within the file /setup/config/users.json.

aterm Client

The MediaFlux aterm may be accessed using http://0.0.0.0:8888/aterm/

Desktop Client

The MediaFlux desktop client may be accessed using http://0.0.0.0:8888/desktop/

Thick Client

One may start and access the Thick Client using the Java Virtual Machine with the following steps:

$ docker/bin/start
# Within another terminal session, please invoke:
$ docker cp mediaflux:/usr/local/mediaflux/bin/aterm.jar ~/aterm.jar
$ java -Xmx4g -Djava.net.preferIPv4Stack=true -jar ~/aterm.jar

Configuration Commands

> server.identity.set :name carolyn
> display font-size 18
> display prompt   "carolyn > "
> display save

Service Documentation

The MediaFlux service documentation may be accessed using http://0.0.0.0.:8888/mflux/service-docs/

Running tests

Fast: bundle exec rspec spec
Run in browser: RUN_IN_BROWSER=true bundle exec rspec spec

Starting the development server

bundle exec rails s -p 3000
Access application at http://localhost:3000/

Production and Staging Deployment

Deploy with Capistrano (we are intending to have a deployment mechanism with Ansible Tower, but that is not yet implemented) bundle exec cap production deploy or bundle exec cap staging deploy

Mail

Mail on Development

Mailcatcher is a gem that can also be installed locally. See the mailcatcher documentation for how to run it on your machine.

Mail on Staging and QA

To See mail that has been sent on the Staging and QA servers you can utilize capistrano to open up both mailcatcher consoles in your browser (see below). Look in your default browser for the consoles

staging command

cap staging  mailcatcher:console

qa command

cap qa  mailcatcher:console

Mail on Production

Emails on production are sent via Pony Express.

User Registration List

For local development, add yourself as a SuperUser to the TigerData preliminary registration list and follow these instructions:

Updating User Registration List

To save updates and make changes to appointed users for early testing of the TigerData site:

Make the requested changes to the Google spreadsheet
Save those updated changes
Download the file as a .CSV file
Copy the downloaded .CSV file to data > user_registration_list.csv
Open a PR to check the updated file into version control
Once that PR is merged, release and deploy the code. This will automatically run the load_users.rake rake task.

Sidekiq

Sidekiq is used to run backgroud jobs on the server. The jobs are created by ActiveJob and ActiveMailer.

Workers

Workers must be running on each server in order for mail to be sent and background jobs to be run. The sidekiq workers are run on the server via a service, tiger-data-workers. To see the status on the workers on the server run sudo service tiger-data-workers status. You can restart the workers by running sudo service tiger-data-workers restart.

pulibrary / tiger-data-app

readme