SCINet Guides List

Access Guides

Web-Based Access to SCINet

SSH access to SCINet

Computing Resources Guides

Differences between Ceres and Atlas

SCINet Atlas

SCINet Ceres

Cite SCINet

External computing resources

SCINet Nomenclature

The software discussed and shown in these user guides is largely open source, can run on a desktop, HPC, or cloud environment, and can be installed with software management systems that support reproducibility (such as Conda, Singularity, and Docker). Below is a quick overview of some of the software, hardware, and confusing nomenclature that is used throughout this site.

Data Management Guides

Data and Storage SOP

Storage Quotas

Each file on a Linux system is associated with one user and one group. On Ceres, files in a user’s home directory by default are associated with the user’s primary group, which has the same name as user’s SCINet account. Files in the project directories by default are associated with the project groups. Group quotas that control the amount of data stored are enabled on both home and project directories.

At login, current usage and quotas are displayed for all groups that a user belongs to. The my_quotas command provides the same output:

$ my_quotas

SCINet Storage

SCINet File Transfer

Rclone: Moving Data To and From Cloud Resources

Cluster Use Guides

Partitions or Queues

Open OnDemand Interface

Linux Command-Line Interface

SLURM Resource Manager

Resource Allocation

Scratch Space

Development Environment Guides

Jupyter

RStudio

Visual Studio Code

Software Installation and Access Guides

User-installed R, Python, and Perl Packages

The popular R, Perl and Python languages have many packages/modules available. Some of the packages are installed on Ceres and are available with the r/perl/python_2/python_3 modules. To see the list of installed packages, visit the Preinstalled Software List page or use module help <module_name> command. If users need packages that are not available, they can either request VRSC to add packages, or they can download and install packages in their home/project directories. We recommend installing packages in the project directories since collaborators on the same project most probably would need same packages. In addition, home quotas are much lower than quotas for project directories.

Preinstalled Software

User-Installed Software with Conda

Conda is a software package manager for data science that allows unprivileged (non-administrative) Linux or MacOS users to search, fetch, install, upgrade, use, and manage supported open-source software packages and programming languages/libraries/environments (primarily Python and R, but also others such as Perl, Java, and Julia) in a directory they have write access to. Conda allows SCINet users to create reproducible scientific software environments (including outside of Ceres) without requiring the submission of a SCINet software request form for new software, or contacting the VRSC to upgrade existing software.

Environment Modules

Singularity/Apptainer Containers

Some software packages may not be available for the version of Linux running on the HPC cluster. In this case, users may want to run containers. Containers are self-contained application execution environments that contain all necessary software to run an application or workflow, so users don’t need to worry about installing all the dependencies. There are many pre-built container images for scientific applications available for download and use.

Apptainer (formerly Singularity) https://apptainer.org is an application for running containers on an HPC cluster. Containers are self-contained application execution environments that contain all necessary software to run an application or workflow, so you don’t need to worry about installing all the dependencies. There are many pre-built container images for scientific applications available for download and use, see section Container Images.

Application Guides

CLC Genomics Workbench

Galaxy

Geneious

SMRTLink/SMRTAnalysis

AWS Guides

AWS Product Documentation

AWS Resource FAQ

Using AWS SAML

EC2 Instance and Pathway Tools

SCINet Guides List

Access Guides

Guide Sections

Accessing Web-Based Interfaces

Available Web-Based Interfaces

Guide Sections

Small Step Installation

After Small Step Installation

SSH Access

Notes and Limitations

Creating a Configuration File

Computing Resources Guides

Guide Sections

Quotas

Software

Submitting a Job

Guide Sections

Onboarding Videos

Technical Overview

System Configuration

Additional Guides

Logging In

Data Transfer

Modules

Quotas

SLURM Resource Manager

Compiling Software

Citation/Acknowledgment

Guide Sections

Amazon Web Services

XSEDE

Open Science Grid

Guide Sections

SCINet vs. Ceres vs. Atlas

Open OnDemand

SLURM

Scientific Coding Languages - Python and R

Data Management Guides

Guide Sections

Definitions

Detailed instructions, using Globus

Alternative instructions, not using Globus

Guide Sections

Quotas on Home and Project Directories

Guide Sections

Quotas

Home Directories

Project Directories

Large Short-term Storage

Temporary Local Node Storage

Juno Archive Storage

Guide Sections

Best Practices

Globus

Small Data Transfer Using scp and rsync

Large Data Transfer by Shipping Hard Drives

Other Ways to Transfer Data

Data Transfer to NCBI

Guide Sections

Getting Ready

rclone installation on Windows

macOS installation

Configuration of rclone on windows or osX

rclone configuration on SciNet

Test

Commands

Advanced commands

Cluster Use Guides

Guide Sections

Community partitions

Partitions that allow all users access to priority nodes

Priority partitions available only to those users who purchased nodes

Guide Sections

Accessing OOD

Using OOD

Interactive Applications

Guide Sections

Interactive Mode

Requesting the Proper Number of Nodes and Cores

Batch Mode