Ingesting Data
Requirements
Install the ingest package
pip install -e path-to-repo/ingest
Preparing the data
The data is expected to be available in bp files format with a specific file structure before it can be ingested properly. This structure and the file expectations are covered in detail in the ingest schema docs. It is recommended that you read those first to set everything up properly.
Ingest
Once the data has been prepared the script can be run:
esimmon watch -f <root-folder-id> -r <url-to-data> -k <girder-api-key> -u <girder-api-url> -v 5 -a <fastapi-url>
-
root-folder-id: This will be the id of the root folder that was created when you followed the Ansible steps to create the Girder instance.- Navigate to http://localhost:8080 (or wherever your Girder instance is currently being hosted)
- Select
Collectionsfrom the left-hand panel - Select
eSimMon>eSimMon-dashboard - Click the blue
iicon in the upper right corner
-
url-to-data: This will point to where the data is. This can be either a URL or a local file path. If it is a local file path it should be prefaced withfile:. -
girder-api-key: This will vary from user to user.- Navigate to your Girder instance
- Select
My Accountfrom the drop-down menu in the upper right hand corner - Navigate to the
API keystab. An API key calledtest_keyshould have been automatically generated on creation. i. If this key is not available selectCreate new key. This only needs to be done once. ii. Enter a name for the key and leave the token duration field empty and leave theAllow all actions on behalf of my useroption selected, the pressCreate. - Select
showunder the key column and then copy the key that is displayed. Keep this information safe and private.
-
girder-api-url: This will be the url of your Girder instance with the prefix/api/v1(http://localhost:8080/api/v1if running locally). -
fastapi-url: This will be the url of your FastAPI instance with the prefix/api/v1(http://localhost:5000/api/v1if running locally). This key is needed to automatically generate and save the default movies for the ingested data once the run is complete.