HPC/FAQ: Difference between revisions

From CNM Wiki
< HPC
Jump to navigation Jump to search
 
(422 intermediate revisions by the same user not shown)
Line 1: Line 1:
== Access and proposals ==
== Access and proposals ==
CNM's HPC ''Carbon'' facility may be accessed by one of two routes:
The HPC ''Carbon'' cluster and its associated software licenses may be accessed,
either under a proposal within the [http://www.anl.gov/cnm/user-information '''CNM User Program''']
like any tool at the CNM, by one or both of the following routes:
or by being an employee of the CNM's hosting institution, [http://www.anl.gov/cnm/ Argonne's NST division].
* CNM Facility Users are eligible under [https://www.anl.gov/cnm/become-a-cnm-user '''CNM's User Access Program''']. Follow the link to learn how to initiate access under this route. Begin by registering as Facility User, then submit a proposal.
* Affiliates of [http://www.anl.gov/cnm Argonne's NST division] are eligible for discretionary access. Affiliate here refers to regular employees, Argonne/University joint appointees, visitor appointees, and student group members, being supervised by CNM staff.


Your starting point to access ''Carbon'' under the user program is to [http://www.anl.gov/cnm/user-information/becoming-user '''become a CNM user'''].
The following is a summary of the main points as they pertain to ''Carbon''.
The following is a summary of the main points as they pertain to ''Carbon''.
=== Legal User Agreement ===
The institution where you work or study must have a legal signed
[https://www.anl.gov/cnm/prepare-for-your-arrival '''User Facility Agreement''']
with Argonne in place before any work under an eventual proposal may be performed.
[https://aps.anl.gov/Users-Information/Legal-Financial/Argonne-User-Facility-Agreements '''Look up existing User Facility Agreements'''] and locate your institution if you have never worked with Argonne before, or you recently changed your affiliation, or are going to change it soon.
If your institution is not shown, ''get started'' with establishing a user agreement as soon as possible. Legal wheels turn slowly and your scientific work could be delayed.
=== User Registration ===
Under CNM's User Access Program, your starting point is to [https://www.anl.gov/cnm/how-to-register-to-become-a-cnm-user '''become a CNM user'''].
=== Proposal Lifecycle Overview ===
<!-- http://www.anl.gov/cnm/user-information/call-proposals -->
# To '''submit a proposal''', follow the instructions at the CNM's [https://www.anl.gov/cnm/cnm-user-information '''Call for Proposals'''].
# Upon successful review of a proposal and grant of an allocation, '''the contacting author''' will receive an email with subject '''User Proposal Status Notification'''.
# '''The contacting author must submit,''' for each proposal, a Safety&Data Form as instructed in the status email. This will confirm the names and badge numbers of participating users, as well as various characteristics of the proposal that we need for statistical reporting to our funding agencies.
# Participating '''users register and respond''' at various steps to have their Argonne user accounts created, activated or reactivated as needed, and have them authorized to access Carbon.
# Participating users '''perform the work proposed'''.
# [https://www.anl.gov/cnm/cnm-acknowledgment-statements-for-publications '''Credit CNM in publications'''] that result from your proposal work with us.
# After the proposal expires, our User Office will ask you to please submit:
#* An approximately 1-page '''Activity Report''' summarizing work performed and perhaps already published under the proposal. We use these reports to document the use of our facilities to funding agencies.
#* An end-of-proposal '''Satisfaction Survey''', to invite your feedback on our process.
== Account Types ==
You access our computer systems by two different user accounts:
=== User Registration Account ===
: – Looks like: <code>123456</code> –<br>
Your ''User Registration Account:''
* … is created when you register as CNM facility user, be it as participant or as Principal Investigator (PI), at the [https://beam.aps.anl.gov/pls/apsweb/ufr_main_pkg.usr_start_page '''User Registration'''] for the Advanced Photon Source (APS) and CNM.
* … has an all-numeric user name (no letters, no leading zeroes), which never changes – this is in fact your ''Argonne Badge Number'',
* … has its own password and [https://beam.aps.anl.gov/pls/apsweb/forgot_password.start_process '''password reset process at APS'''],
Use your User Registration Account to:
* … [https://beam.aps.anl.gov/pls/apsweb/ufr_main_pkg.usr_start_page '''update your user registration'''], such as when you change your affiliation, or to update visa information if Argonne needs them for you,
* … [https://pico.cnm.anl.gov/proposals/people/login '''submit a user proposal to CNM''']
* … [https://beam.aps.anl.gov/pls/apsweb/rt0004.intro_process '''take remote training'''] prior to beginning work, and prior to being assigned an Argonne domain account, which is described next.
=== Argonne Domain Account ===
: – Looks like: <code>'''b'''''123456''</code> or <code>''jsmith''</code> –<br>
Your ''Argonne Domain Account:''
* … is created '''when you first need it''', such as when you are a PI and have received the go-ahead on your first proposal with us,
* … has, at first, a user name beginning with <code>'''b'''</code>, followed by your badge number (without any leading zeroes),
* … will get, following your request, a personalized user name, such as <code>''jsmith''</code>
* … has its own password and uses its own [https://mypassword.anl.gov/ '''password reset and recovery'''].  <!-- Redirects to https://servicenow.anl.gov/pr -->
Use your Argonne Domain Account to:
* … submit the '''Safety&Data Form (UWS)''' of each proposal for which you are the PI or delegated spokesperson,
* … perform work – always remotely – at [[HPC|CNM's '''High Performance Computing system ''Carbon''''']], <!-- , where you will typically run Linux terminal commands. This account's user name never begins with a number. -->
* … handle almost all other tasks at Argonne.
For assistance, please contact our [mailto:[email protected]?subject=Account%20Question CNM User Office]. – Specify ''which task'' you have difficulties with.


== Getting started on a proposal ==
== Getting started on a proposal ==
<!-- We will have notify the proposal spokesperson (usually the Principal Investigator,PI) once a proposal has been approved. At this point, … -->
<!-- We will have notify the proposal spokesperson (usually the Principal Investigator,PI) once a proposal has been approved. At this point, … -->
To get a proposal started once it has been approved, '''Action is required for each proposal''' by the proposal spokesperson (usually the Principal Investigator,PI), regardless of any previous proposals with us.
To start working on a proposal once it has been granted, '''action is required for each proposal''' by the proposal spokesperson (usually the Principal Investigator,PI), regardless of any previous proposals with us.
At this stage, we collect and update information about each proposal for reporting purposes to our funding agencies, and ensure that safety and training requirements are met by all participating users.
At this stage, we collect and update information about each proposal for reporting purposes to our funding agencies, and ensure that safety and training requirements are met by all participating users.


=== To get started as spokesperson or delegate ===
=== To get started as spokesperson or delegate ===
:* Locate the email that we sent to you with subject ''User Proposal Status Notification'', and follow the instructions therein. Some steps are needed only once because they pertain to your institution. Your key action, for each proposal, will be to fill in the Safety and Data form follow the ''User Work Submittal'' (UWS) link given in the email.
# Locate the email to you with subject ''User Proposal Status Notification'', sent by the CNM User Office.
:* Remind participating users to review their individual access requirements.
#* If you are delayed in starting your proposal, you may need to expand your mailbox search time frame, to begin the search at possibly several months before present.
:* Please be patient. '''Processing typically takes at least one business day,''' as both our User Office staff and your Scientific Contact need to attend to your submission in person.
#* You may need to inspect the "Junk" or "Spam" folder of your mail application.
# Follow the instructions in that email.
#* Certain steps are needed only once because they pertain to your institution, notably filing a User Agreement.
#* Before work on a new proposal may begin, we may ask you to submit ''Activity Reports'' that are due for previous proposals upon their conclusion.
#* Fill in the ''Safety and Data form'', i.e., follow the ''User Work Submittal'' (UWS) link that is given in our notification email.
#:* Only the spokesperson may fill in the UWS form, or the PI may [mailto:[email protected]?subject=Delegate%20UWS-PI%20submission designate a delegate]. Changes after the initial UWS submission by you (the PI or delegate) can only be made by your Scientific Contact at the CNM – see below.
#:* The UWS is '''due for each proposal''', as it collects certain metadata that we need, on a per-proposal-basis, for usage reporting to our funding agencies.
#:* Confirm or add users in section "Personnel Participating In User Proposal" of the UWS form. Enter the '''badge numbers''' for all participating users, as far as they are known at submission time.
# Direct all users who you expect to access ''Carbon'', possibly including yourself, to follow the instructions in section  '''[[#Ongoing user access requirements]]'''.
# Be patient. '''UWS Processing typically takes at least one business day,''' as both our User Office staff and your Scientific Contact need to attend to your submission in person.
# The User Office will notify the spokesperson or delegate once work on a proposal may begin, issuing what we call the ''User Work Approval (UWA)''.  It will be valid for one year for regular proposals, and 6 months for rapid-access proposals.
 
=== To get started as participating user ===
For new proposals or users new to CNM:
# Gently remind your spokesperson/PI to submit the UWS as shown above, or ask this person to nominate you via an email to the [mailto:[email protected]?subject=Delegate%20UWS CNM User Office].
# Continue at section [[#Actions required by the User]].


The User Office will notify the spokesperson once work on a proposal may begin.
For proposals already in progress, see the next section.


=== To get started as participating user ===
== Adding users to a proposal ==
:* Gently remind your spokesperson/PI to do the above, or ask this person to nominate you via an email to the [mailto:[email protected]?subject=Delegate%20UWS CNM User Office].
Most users are nominated to work on a proposal when it is first submitted.
:* Enter, review, or update your [https://beam.aps.anl.gov/pls/apsweb/ufr_main_pkg.usr_start_page '''User Registration.''']
Additional users can be authorized to work under a proposal at any time after acceptance.
Actions are needed from both the user and the proposal spokesperson.
 
=== Actions required by the User ===
:1. [[#Ongoing user access requirements|Enter or review your registration]] as facility user and update as necessary.
:2. Follow up in a day or a week as specified at [[#Follow-up actions required by the User]].
 
=== Actions required by the PI or spokesperson ===
:3. Confirm that the user has entered or reviewed their registration, and determine their badge number(s).
:4.  Add the user to the proposal. <!-- How to do this depends on whether the proposal [[#To get started as spokesperson or delegate|has already started or not (see above)]]. -->
:* If the proposal has '''not yet started:''' See the section above [[#To get started as spokesperson or delegate]].
:* If the proposal has '''already started:''' Email the users' names '''and badge numbers''' to the proposal's ''Scientific Contact'' at the CNM and ask for the UWS to be augmented.
<!-- If you have questions, please contact the [mailto:[email protected]?subject=Reinstate%20user%20account CNM User Office]. -->
 
=== Actions required by the Scientific Contact ===
: 5. Open the UWS-SciCon form in the  CNM Proposal Dashboard on the "Inside CNM" web page.
:* Add users under section "Participating Personnel".
:* Enter users' badge numbers which may have been left empty in earlier UWS submissions, as happens by necessity for very newly registered users.
:* Again, review that badge numbers are present and are correct for all relevant participants that are to work on ''Carbon'', or at least need to use {{sshgw}}.
 
=== Follow-up actions required by the User ===
: 6. Contact our [mailto:[email protected]?subject=Process%20new%20user%20registration CNM User Office] and ask to review your registration and to initiate the next steps.
:* To work onsite at CNM or remotely on Carbon, you will need an ''Argonne computer account'', which is different from the account you used to enter your User Registration details – see section [[#Account Types]].
:* If you are new to Argonne, our User Office will be in contact with you to set up this account. Since the Argonne account is for more general use than your registration account, it will involve formal approvals, cyber security training, etc. and typically takes about ''one week'' to complete.
: 7. Complete or refresh training courses, as requested by the User Office.
:* Course requirements are triggered at various levels. There are courses for being a visitor at Argonne, a user at CNM, and for work performed under proposals, be it onsite or remotely.
: 8. Have your Argonne computer account finalized and ready.
:* '''New users:''' See section [[#Finalize your Argonne computer account]].
:* ''' Existing users:'''  Check for and if necessary renew your expired password, [[#Verify or change your password|see below]].
 
=== Carbon account and access ===
When all prerequisites are met, your Argonne account will be  enabled or (if so needed) re-enabled for use on Carbon.
 
: Enabling your account on Carbon <font color="red">''does not happen immediately''</font> after a password reset or rename, or submitting proposal forms – it is a separate process.
 
: 9. '''Await confirmation''' from us that your account has been enabled ''on Carbon'' and been added to one or more proposals.
: 10. Review and follow the instructions on [[HPC/Network Access]], i.e., how to connect to the cluster.
 
If you do not receive an expected notification within two days, contact our [mailto:[email protected]?subject=Process%20new%20user%20registration User Office] and describe your issue.
 
== Finalize your Argonne computer account ==
After your registration as a new Argonne Facility User has been processed, an [[#Account Types|Argonne "domain account"]] will be created for you.
Its user name will be in the form <code>b123456</code> and our User Office will have communicated to you an initial password.
 
To begin work on ''Carbon'' and possibly other CNM instruments, personalize both your ''password'' and your ''user name'', in this order.
 
; For Argonne personnel<nowiki>:</nowiki> User name vs. email address:
:* Your '''user name''' is the one by which you log in to Argonne-internal services, and takes one of three forms. Web login services typically indicate which user name form they expect..
:**<code>username</code> –  This is the most typical form expected, used to log in to {{sshgw}}.
:** <code>[email protected]</code> – Your user name with a ''directory domain'' appended.
::* <code>ANL.GOV\username</code> – An alternate form of your user name and directory domain.
:* Your user name is '''not necessarily''' merely the name part of an Argonne '''email address''' that you will have if you work or study at Argonne.
<!-- name part of the address (before <code>@anl.gov</code>) may have been established by setting an email alias. -->
<!-- If you were to reverse the order, your account would look as being still unused or dormant, and be ineligible for use. -->
 
=== Change your password ===
* Change your password as instructed when your account got created.
*: See also section [[#Verify or change your password]] below.
* If you chose to call the Argonne Help Desk at [tel:+1-630-252-9999 +1-630-252-9999], option 4, to make the password change:
*: While on the phone regarding your password, state that you are a ''CNM Carbon User'' and ask to also ''personalize your user name''.
 
=== Change your user name ===
# Request a user name change, either over the phone or by email. We will choose the user name for you.
#; To request by phone: Call the Argonne Help Desk at [tel:+1-630-252-9999 +1-630-252-9999], and select option 4.
#; To request by email: Follow this link: [mailto:[email protected]?subject=User%20name%20change%20request&body=I%20am%20a%20new%20user%20of%20CNM%27s%20Carbon%20HPC%20system.%20Please%20personalize%20the%20user%20name%20of%20my%20Argonne%20Domain%20computer%20account.%0D%0A%0D%0AMy%20full%20name%3A%09__%0D%0AArgonne%20Badge%3A%09__%0D%0A%0D%0AThank%20you.%0D%0A%0D%0A '''Open email template'''] – (Use a web browser that supports [https://en.wikipedia.org/wiki/Mailto "mailto:" links].)
<!-- Wiki authoring note: Encode body text by:
    perl -MURI::Escape -p -e 's/\n/\r\n/; $_ = uri_escape($_);'
-->
## Respond from the ''same email address'' as you filled in at the CNM User Registration form.
##: In your email application, choose or switch to the ''mail account'' that handles the address you registered with at our User Registration. Messages pertaining to your Argonne activities that are sent from an unrelated email account may be ''disregarded'' (not taken up) by our help desk staff, for obvious reasons.
## Fill in your ''name'' and ''badge number'' in the email template that opens, then send the email.
<!-- ## Fill in the email template.
##: If you have a user name at your institution that you are happy with it, supply that name as the first of three name preferences. This will make it simpler to configure your network connections to the cluster.
-->
<!-- ## Check your '''From:''' address again.
##: The email address you send from must match your User Registration, see above.
-->
<!-- # Request user names that have:
#* Lowercase letters '''a–z''', at most 8.
#* At most 2 numerals '''0–9''', preferably none.
#* '''No''' punctuation, including none of <code>.</code> <code>-</code> <code>_</code>, and no other special characters like <code>@,;:%&amp;+*=()[]{}\|/&lt;&gt;#'"`~!$^</code>.
-->
# '''Await an emailed ticket number''' from Argonne's service desk, or '''follow the instructions of the person''' you called.
#* Ticket numbers are assigned upon manual review, typically within an hour or so during business hours.
#* If you '''did not get a ticket number''' after such reasonable time, re-send the request, making sure you do so ''from the email account you provided at User Registration''. <!-- [mailto:[email protected]?subject=User%20name%20change%20request ask the CNM User Office] to review. -->
# Check your email for follow-up questions and respond.
# Await the completion notice for your user name change.


We will notify you once your account is ready and added to the respective proposals.
=== Next steps ===
''Carbon'' access <font color="red">''is not immediately enabled''</font> once your user name change request has been completed – it is a separate process.
# Expect an email that your ''Carbon'' access has been granted.
<!--
# Follow the instructions in that email to learn [[HPC/Network Access|'''how to log in''']] and where to find further documentation
-->
# Proceed under [[#Carbon account and access]].


== Proposal Troubleshooting ==
== Proposal troubleshooting ==
'''Email the [mailto:[email protected]?subject=Proposal%20Question CNM User Office]''' or [http://www.anl.gov/cnm/about-us/contact-us phone us] if you have any questions or concerns, such as:
'''Email the [mailto:[email protected]?subject=Proposal%20Question CNM User Office]''' with any questions or concerns, such as:
* about proposals,
* about proposals,
* about users, or
* about users, or
Line 33: Line 195:
'''Only our User Office staff''' is able to review all aspects of your proposal or your user access requirements, and determine any steps that have yet to be taken or need to be refreshed.
'''Only our User Office staff''' is able to review all aspects of your proposal or your user access requirements, and determine any steps that have yet to be taken or need to be refreshed.


 
Include in your message:
'''Include in your message:'''
* Proposal number(s),
* Proposal number(s),
* Name of the Principal Investigator (PI),
* Name of the Principal Investigator (PI),
* Names and badge numbers of participants.
* Names and badge numbers of participants.


== Adding users to a proposal ==
== Ongoing user access requirements ==
Most users are nominated to work on a proposal when it is first submitted.
For you to log into Carbon and its SSH gateway, a number of criteria must be met, most of which are subject to '''expiration dates''' and '''require action from you for renewal'''.
Additional users can be authorized to work under a proposal at any time after acceptance.
 
Actions are needed from both the user and the proposal spokesperson.
You will ''not'' get notified (for various reasons) on some of these expirations.
Different duration terms may cause expirations to happen right in the middle of your proposal's lifecycle and can cause immediate inconvenience.
Depending on which criterion expired, one or both of the following will happen to your Argonne computer account:
* The account's permission to use {{sshgw}} will be revoked.
* The account ''as such'' will be disabled, equivalent to revoking all its permissions.
 
To recover, do one or more of the following:
 
=== User Registration ===
# Review and update your [https://beam.aps.anl.gov/pls/apsweb/ufr_main_pkg.usr_start_page '''User Registration.''']
#* For this system, log in  with your ''badge number'' and the password associated with it, not your regular Argonne account – see section [[#Account Types]] above.
#* The registration itself requires renewal at least every 2 years, or earlier when any one of various prerequisite items expires.
#* For '''non-US citizens''' present in the US: Review and update registration items regarding your '''US visa''' or related work permit. A current US work or study status is required to access Argonne computers, the same as if you were to visit in person.
# Ask our [mailto:[email protected]?subject=Reinstate%20user%20account CNM User Office] staff to review your registration updates and to recreate or re-instate your Argonne account, as the case might be.
 
=== Legal User Agreement ===
* If you '''changed your affiliation''' (the institution where you work or study), check to see if your new institution has a legal [https://aps.anl.gov/Users-Information/Legal-Financial/Argonne-User-Facility-Agreements '''User Facility Agreement'''] with Argonne in place and request one if not.
*: Be advised that the process may take several weeks to percolate through legal and adminstrative channels.
 
=== Training ===
*Review and renew your [http://beam.aps.anl.gov/pls/apsweb/rt0004.intro_process '''User Courses.''']
*: For remote users, the ESH223 course "Cybersecurity Annual Education and Awareness" is the one most likely to be in need of renewal.
 
=== Password ===
* Review and update the [[#Verify or change your password|'''password''']] to your Argonne computer account  if you now have or previously had such an account.
*: If your account's password expired, we consider the account dormant and ''will not notify you about new proposals'' where you are listed as a participant.


; Actions required by the User:
=== Active proposal ===
:1. [[#Review access requirements|Enter or review your registration]] as facility user and make updates as necessary.
You must be a participant in at least '''one active or recently expired [https://www.anl.gov/cnm/become-a-cnm-user User Proposal]'''.
; Actions required by the PI or spokesperson:
You may run compute jobs under a given proposal while it is active, i.e., within the dates stated in the proposal's User Work Approval (UWA).
:2. Confirm that the user has entered or reviewed their registration, and determine their badge number(s).
* To review the dates for your proposals, ask your PI to search their email archive for subjects ''"Work Approval Received"'' or ''"Proposal Expiration"''.
:3.  Have the user added to the proposal. How to do this depends on whether the proposal [[#To get started as spokesperson or delegate|has already started or not (see above)]].
:* For proposals that have already started, submit the users' names and badge numbers to the proposal's ''Scientific Contact'' at the CNM.
:* If the proposal hasn't started yet, add the users ''yourself'' when you initially fill in the User Work Submittel (UWS) as part of the regular proposal startup process. The UWS can and must be submitted by the spokesperson, once only, to start a proposal. To make changes after your submission, ask your Scientific Contact at the CNM.
: Your ''Scientific Contact'' and the link to the UWS for each proposal are shown in our email to you with the subject containing ''User Proposal Status Notification''.


<!-- If you have questions, please contact the [mailto:cnm_useroffice@anl.gov?subject=Reinstate%20user%20account CNM User Office]. -->
=== Data-only access for inactive proposals ===
After your last active proposal on Carbon has expired you may still access,  for '''up to 30 days''', the SSH gateway and Carbon's login nodes.
Past that time window, your access will be revoked and '''your data may be deleted''', following [https://www.anl.gov/cnm/complete-your-work-at-cnm '''CNM's Data Retention Policy'''].  


== User access requirements ==
* While you still have access, offload from Carbon all your files and data that you may wish to keep.
A number of items are required from an individual person to log into ''Carbon'', most of which are subject to an expiration date and require periodic renewal.
*: The CNM cannot be expected or held responsible to store your data beyond your access window.


* Review and update your [https://beam.aps.anl.gov/pls/apsweb/ufr_main_pkg.usr_start_page '''User Registration.''']
=== Practical hints ===
*: Your Argonne computer account will be disabled upon expiration of certain registration items. In particular, if you are not a US citizen, you will require a current '''US visa''' or related work permit to access Argonne computers, just as if you were to visit in person. This will very likely happen in the middle of your proposal's lifetime. For some of these expirations, no notice will be sent to you (for varying reasons), and you will suddenly find that you can no longer access mega.
* Set yourself '''calendar entries''' about one year into the future to remind yourself to renew any of your user registration or training requirements.
*: After you updated your user registration, contact the [mailto:[email protected]?subject=Reinstate%20user%20account CNM User Office] to have your Argonne account re-instated.
* After you were added to a user proposal, '''wait at least an hour or more''' before trying to access our SSH gateway, preferably until the next morning.
* Renew your [http://beam.aps.anl.gov/pls/apsweb/rt0004.intro_process '''User Courses.''']
*: Updates of your status need to be propagated through a handful of systems, each being done about hourly, so it may take several cycles for your status change to reach {{sshgw}}.
*: For remote users, the ESH223 course "Cybersecurity Annual Education and Awareness" is the one most likely in need of renewal.
* You must be a participant in at least one ''active or recently expired'' [http://nano.anl.gov/users/proposal_process_brief.html '''User Proposal'''].
*: To review dates for your proposal, ask your PI to search their email archive for mails with ''"Work Approval Received"'' or ''"Proposal Expiration"'' in the subject.
*: Compute jobs may be run while a proposal (specifically, the user work authorization) is active. For at least 30 days thereafter, users are entitled to ''data access'' only, following [http://nano.anl.gov/users/overview.html#Anchor15 CNM's Data Retention Policy].
* Access to mega requires that the User Work Submittal for a proposal contain your '''badge number'''.
*: This is a more arcane technical issue. If your badge number was left empty at the original submission of the UWS (typically when you are a newly registered user), ask the  [mailto:[email protected]?subject=Update%user%in%20UWS CNM User Office] or your Scientific Contact to augment and resubmit the form.


== Login issues ==
== Login issues ==
When you ask "I cannot log in" or "My password does not work", consider the following sections:
When you ask "I cannot log in" or "My password does not work", consider the following sections:


=== Review host names ===
=== Review your access requirements ===
Make sure you connect to the correct host name, which is '''mega.cnm.anl.gov''' for the SSH gateway and '''carbon.cnm.anl.gov''' when connecting from an onsite work computer or over VPN.
See section [[#Ongoing user access requirements]] above.
The previous name for the latter was '''clogin.cnm.anl.gov''' and will continue to work.
 
-- See [[HPC/Network Access]].
=== Verify or change your password ===
Do '''one of the following:'''
* Visit '''https://mypassword.anl.gov/''' <!-- (https://servicenow.anl.gov/pr . Mnemonic: "pr" stands for '''p'''assword '''r'''eset). -->
** Complete the ''Password Enrollment'' process at this service on your first visit when you ''know'' your password. You will be (or have been) thus instructed when your account has been created.
** You can ''verify'' or ''change'' your existing known password.
** You can ''reset'' or ''unlock'' your password '''only''' after you enrolled.
<!-- #*: With enrollment completed, you can later reset your password, and also [[#Password lockouts|unlock your account]] in case it has been blocked after too many incorrect password attempts. -->
** You cannot use this service if your account has been ''disabled'' as such, in which case revisit the section [[#Ongoing user access requirements]].
* You can ''change'' your ''known'' password also at our SSH gateway host <code>mega</code>.
<!-- ** You will be asked to update an expired password upon logging in. -->
** At least one of your CNM proposals must still be active.
** You cannot log in if your account is ''disabled'' as such.
: Error messages for login failures will not disclose a specific reason. You cannot tell if merely your password was wrong or your account is disabled.
* If you cannot use any of the above systems:
** request a [mailto:[email protected]?subject=Password%20reset password reset at the Argonne Service Desk], or
** request a [mailto:[email protected]?subject=Requesting%20password%20reset password reset from the CNM User Office].
<!--# <font color="grey">Visit our older account profile service, being phased out in early 2021: '''https://mypassword.anl.gov/'''.
** You must have previously completed an Argonne Account Profile at this service.
** Profiles from this service were not transferred to our current account service above.</font> -->


=== Verify your password ===
After a password update, you should be able to connect to {{sshgw}} normally.
Visit https://credentials.anl.gov/ and verify that your username and password are correct.
If you find that you cannot log in, however, retry only '''after an hour or two''', as in some cases extra time is needed for passwords or other account changes to propagate.


=== Review your access requirements ===
=== Password lockouts ===
See section [[#User access requirements]] above.
After trying incorrect passwords several times, your account may be '''temporarily locked out.''' To recover, try one of the following:
* Wait '''approximately 30 min.'''
* Unlock at '''https://mypassword.anl.gov/'''<!-- https://servicenow.anl.gov/pr --> – available if you have previously enrolled there.


=== Request a password reset ===
=== Review host names ===
* To have your password reset, email the [http://nano.anl.gov/contact_us.html CNM User Office], at [mailto:cnm_useroffice@anl.gov [email protected]].
Connect to the correct host names:
* When you connect to mega with still your temporary password in place, mega will ask for a new password. You can safely change your password at this point.
* When connecting from outside Argonne, at least ''two ssh sessions'' are required.
* You can also change your password at https://credentials.anl.gov/ - However, a change there will take a few hours to become active on mega.
** {{sshgw|fqdn}} – an ssh "tunnel setup" connection.
** {{host|carbon}} – one or more "payload" connections for the ssh, scp, or sftp commands. Here, {{host|carbon}} is not a hostname but an entry in your ssh configuration that you must make (called a host alias or "profile", depending on the ssh application). The entry stands for a connection to localhost (your machine), at a port number forwarded by the preceding tunnel setup connection.
* When onsite (for any user) or using VPN (for Argonne staff):
** {{host|carbon.cnm.anl.gov}}. The previous name was {{host|clogin. …}}.
To learn more, read [[HPC/Network Access]].


=== Review network configuration ===
=== Review network configuration ===
* Read again [[HPC/Network Access]], and follow the instructions for your platform.
* Read again [[HPC/Network Access]], and follow the instructions for your platform.


=== Practical hints ===
=== Request support ===
* After you were added to a user proposal, wait at least an hour or more before trying to access mega, preferably until the next morning.
* If the above sections did not help you to resolve your issue, [[HPC/Support|submit a support request]].
*: Updates of your status need to be propagated through a handful of systems, each being done about hourly, so it may take several cycles, for your status change to reach mega.
 
* Set yourself '''calendar entries''' about one year into the future to remind yourself to renew any of your user registration or training requirements.
== Expected files or commands are unavailable ==
When you cannot create or edit files or directories under your home directory, or you find that commands you expect are not found, then:
 
# Inspect the beginning of your command line, called the ''prompt'':
#* If your prompt mentions "@mega" or "GATEWAY", e.g.:
#:: <code>''username''<font color="brown">'''@mega5 CNM-GATEWAY'''</font> $ </code>
#: then you are working '''not on Carbon just yet''', but rather on its SSH gateway. Visit [[HPC/Network_Access]] to learn how to connect properly.
#* If the prompt mentions "@login", e.g.:
#:: <code>''username''<font color="brown">'''@login6'''</font> … $</code>
#: then there may be a temporary file system issue which may resolve after a few minutes.
#: If such issue persists or recurs over several hours, however, read the message that will have appeared right after you logged in, above the first command prompt. To locate these messages, scroll up the content of your terminal application window, or log out and log back in to bring up these messages again.
 
=== My home directory is read-only ===
=== I cannot edit my .bashrc file ===
=== Expected commands unavailable ===
In all these situations, see [[#Expected files or commands are unavailable | the beginning of the current section]].


== Mailing lists ==
== Mailing lists ==
Announcements about Carbon are made on the [https://lists.anl.gov/mailman/listinfo/cnm-hpc-announce ''cnm-hpc-announce'' mailing list], hosted at Argonne. These list pages and the archive are, unfortunately, only accessible from onsite or via VPN.
Announcements about Carbon are made on the [https://lists.anl.gov/mailman/listinfo/cnm-hpc-announce ''cnm-hpc-announce'' mailing list], hosted at Argonne.
The mailing list home page and its archive are, unfortunately, only accessible from onsite or (for authorized users) over VPN.


* To '''update your email address''' on the list, simply unsubscribe as shown in the next item, then re-subscribe by sending a blank message to [mailto:cnm-hpc-announce-join@lists.anl.gov cnm-hpc-announce'''-join'''@lists.anl.gov].
* To '''unsubscribe''' from the mailing list, do one of the following:
* To '''unsubscribe''' from the mailing list, send a blank message to [mailto:cnm-hpc-announce-leave@lists.anl.gov cnm-hpc-announce'''-leave'''@lists.anl.gov] and follow up on the confirmation notice.
*# Open the "unsubscribe" link at the bottom of a recent message that you received from the mailing list.
*: See the [http://www.gnu.org/software/mailman/mailman-member/node22.html GNU Mailman] documenation for background.
*# Send a blank message to [mailto:cnm-hpc-announce-leave@lists.anl.gov cnm-hpc-announce'''-leave'''@lists.anl.gov] and follow up on the confirmation notice.
* To '''subscribe''' to the mailing list, send a blank message to [mailto:cnm-hpc-announce-join@lists.anl.gov cnm-hpc-announce'''-join'''@lists.anl.gov] and follow up on the confirmation notice.
* To '''change your email address''' on the list, simply do both of the above, in order.
 
; Hints:
* When sending commands by email, be sure to have the relevant email account selected in the "From" line of the compose window in your email application.
* Inspect the '''Junk''' mail folder of your email application if you do not receive a confirmation message for subscribing or unsubscribing within a minute or so.
* See the [http://www.gnu.org/software/mailman/mailman-member/node22.html GNU Mailman] documenation for background.


== Applications ==
== Applications ==
Line 165: Line 390:
To get allocation details for accounts that have CreditLimits, run the <code>gbalance</code> command. Pass on -u ''username'' or ''-p projectname'' to select your allocations:
To get allocation details for accounts that have CreditLimits, run the <code>gbalance</code> command. Pass on -u ''username'' or ''-p projectname'' to select your allocations:
  gbalance -h -u $USER
  gbalance -h -u $USER
: Use the literal string <code>$USER</code> which makes the shell fill in your actual username.
: Type <code>$USER</code> ''as shown''. The command interpreter will fill in your actual username.
The ouput looks like:
The ouput looks like:
  Id  Name    Amount    Reserved Balance    CreditLimit '''Available'''
  Id  Name    Amount    Reserved Balance    CreditLimit '''Available'''
Line 178: Line 403:
; Name: The project name (for use with <code>qsub -A</code> or <code>#PBS -A</code>).
; Name: The project name (for use with <code>qsub -A</code> or <code>#PBS -A</code>).
; Amount: Amount for transactions completely on the books for the project account; does not include running jobs or credits. Deposits are allocated by the User Office and implemented by the Carbon administrator.
; Amount: Amount for transactions completely on the books for the project account; does not include running jobs or credits. Deposits are allocated by the User Office and implemented by the Carbon administrator.
; Reserved: Amounts held in reserve by all running jobs using this account. The reserve ensures that a job does not cause an overdraft when it finishes and when its actual use will be booked. The quantity is calculated by walltime * number of cores blocked. When a job terminates, the charge according to the actual time used will be ''subtracted'' from Amount, and the unused quantities will be ''re-added'' to Amount.
; Reserved: Amounts held in reserve by all running jobs using this account. The reserve ensures that a job does not cause an overdraft when it finishes and when its actual use will be booked. The quantity is calculated by walltime * number of cores blocked. When a job terminates, the charge according to the actual time used will be ''subtracted'' from Amount, and the unused quantities will be ''re-added'' to Available.
; Balance: Available for new jobs; may go negative if CreditLimits are in place.
; Balance: Available for new jobs; may go negative if CreditLimits are in place.
  Balance = Amount - Reserved
  Balance = Amount - Reserved
Line 184: Line 409:
; Available: Relevant quantity for new jobs. Must be positive for a new job to start, and large enough to Reserve the entire job.
; Available: Relevant quantity for new jobs. Must be positive for a new job to start, and large enough to Reserve the entire job.
  Available = Balance + CreditLimit
  Available = Balance + CreditLimit
== Why did my account balance suddenly drop? ==
The compute time physically available by Carbon's processors is a "perishable" resource — CPU-hours pass by regardless of being used or not.
Therefore, your allocations are time-restricted in a "use-it-or-lose-it" manner.
This is done to encourage consistent use of the machine throughout allocation cycles.
Otherwise, resource contention becomes likely near the end of an allocation cycle, one year after each bunch of proposals has started, during which it would be increasingly impractical and eventually physically impossible to use up a large remaining allocation within a short time.
You would need to claim a large fraction of Carbon's nodes during a relatively short time window, which is unlikely to be possible because jobs from other users will be running as well.
=== Allocation expiration schedule ===
Unused allocations ''will expire'', following to schedule:
* Your allocation is provided in three equal-sized installments.
* All installments are active from the beginning.
* Installments ''expire'' in a staggered fashion, currently after 4, 8, and 12 months, respectively. A diagram might illuminate this:
    Proposal                Proposal
      start                expiration
        |-------|-------|-------|------> Time   
        0      4      8      12      (months)
        |
Installment
        |
    (1)|########....................              KEY
        |                           
    (2)|################............              .  Installment is inactive
        |                           
    (3)|########################....              #  Installment is active
        |
* Your jobs will, sensibly, be booked against the installments that expire the earliest.


== My question is not answered here ==
== My question is not answered here ==
See [[HPC/Support]].
See [[HPC/Support]].

Latest revision as of 14:10, November 7, 2024

Access and proposals

The HPC Carbon cluster and its associated software licenses may be accessed, like any tool at the CNM, by one or both of the following routes:

  • CNM Facility Users are eligible under CNM's User Access Program. Follow the link to learn how to initiate access under this route. Begin by registering as Facility User, then submit a proposal.
  • Affiliates of Argonne's NST division are eligible for discretionary access. Affiliate here refers to regular employees, Argonne/University joint appointees, visitor appointees, and student group members, being supervised by CNM staff.

The following is a summary of the main points as they pertain to Carbon.

Legal User Agreement

The institution where you work or study must have a legal signed User Facility Agreement with Argonne in place before any work under an eventual proposal may be performed.

Look up existing User Facility Agreements and locate your institution if you have never worked with Argonne before, or you recently changed your affiliation, or are going to change it soon.

If your institution is not shown, get started with establishing a user agreement as soon as possible. Legal wheels turn slowly and your scientific work could be delayed.

User Registration

Under CNM's User Access Program, your starting point is to become a CNM user.

Proposal Lifecycle Overview

  1. To submit a proposal, follow the instructions at the CNM's Call for Proposals.
  2. Upon successful review of a proposal and grant of an allocation, the contacting author will receive an email with subject User Proposal Status Notification.
  3. The contacting author must submit, for each proposal, a Safety&Data Form as instructed in the status email. This will confirm the names and badge numbers of participating users, as well as various characteristics of the proposal that we need for statistical reporting to our funding agencies.
  4. Participating users register and respond at various steps to have their Argonne user accounts created, activated or reactivated as needed, and have them authorized to access Carbon.
  5. Participating users perform the work proposed.
  6. Credit CNM in publications that result from your proposal work with us.
  7. After the proposal expires, our User Office will ask you to please submit:
    • An approximately 1-page Activity Report summarizing work performed and perhaps already published under the proposal. We use these reports to document the use of our facilities to funding agencies.
    • An end-of-proposal Satisfaction Survey, to invite your feedback on our process.

Account Types

You access our computer systems by two different user accounts:

User Registration Account

– Looks like: 123456

Your User Registration Account:

  • … is created when you register as CNM facility user, be it as participant or as Principal Investigator (PI), at the User Registration for the Advanced Photon Source (APS) and CNM.
  • … has an all-numeric user name (no letters, no leading zeroes), which never changes – this is in fact your Argonne Badge Number,
  • … has its own password and password reset process at APS,

Use your User Registration Account to:

Argonne Domain Account

– Looks like: b123456 or jsmith

Your Argonne Domain Account:

  •  … is created when you first need it, such as when you are a PI and have received the go-ahead on your first proposal with us,
  • … has, at first, a user name beginning with b, followed by your badge number (without any leading zeroes),
  • … will get, following your request, a personalized user name, such as jsmith
  • … has its own password and uses its own password reset and recovery.

Use your Argonne Domain Account to:

  • … submit the Safety&Data Form (UWS) of each proposal for which you are the PI or delegated spokesperson,
  • … perform work – always remotely – at CNM's High Performance Computing system Carbon,
  • … handle almost all other tasks at Argonne.

For assistance, please contact our CNM User Office. – Specify which task you have difficulties with.

Getting started on a proposal

To start working on a proposal once it has been granted, action is required for each proposal by the proposal spokesperson (usually the Principal Investigator,PI), regardless of any previous proposals with us. At this stage, we collect and update information about each proposal for reporting purposes to our funding agencies, and ensure that safety and training requirements are met by all participating users.

To get started as spokesperson or delegate

  1. Locate the email to you with subject User Proposal Status Notification, sent by the CNM User Office.
    • If you are delayed in starting your proposal, you may need to expand your mailbox search time frame, to begin the search at possibly several months before present.
    • You may need to inspect the "Junk" or "Spam" folder of your mail application.
  2. Follow the instructions in that email.
    • Certain steps are needed only once because they pertain to your institution, notably filing a User Agreement.
    • Before work on a new proposal may begin, we may ask you to submit Activity Reports that are due for previous proposals upon their conclusion.
    • Fill in the Safety and Data form, i.e., follow the User Work Submittal (UWS) link that is given in our notification email.
    • Only the spokesperson may fill in the UWS form, or the PI may designate a delegate. Changes after the initial UWS submission by you (the PI or delegate) can only be made by your Scientific Contact at the CNM – see below.
    • The UWS is due for each proposal, as it collects certain metadata that we need, on a per-proposal-basis, for usage reporting to our funding agencies.
    • Confirm or add users in section "Personnel Participating In User Proposal" of the UWS form. Enter the badge numbers for all participating users, as far as they are known at submission time.
  3. Direct all users who you expect to access Carbon, possibly including yourself, to follow the instructions in section #Ongoing user access requirements.
  4. Be patient. UWS Processing typically takes at least one business day, as both our User Office staff and your Scientific Contact need to attend to your submission in person.
  5. The User Office will notify the spokesperson or delegate once work on a proposal may begin, issuing what we call the User Work Approval (UWA). It will be valid for one year for regular proposals, and 6 months for rapid-access proposals.

To get started as participating user

For new proposals or users new to CNM:

  1. Gently remind your spokesperson/PI to submit the UWS as shown above, or ask this person to nominate you via an email to the CNM User Office.
  2. Continue at section #Actions required by the User.

For proposals already in progress, see the next section.

Adding users to a proposal

Most users are nominated to work on a proposal when it is first submitted. Additional users can be authorized to work under a proposal at any time after acceptance. Actions are needed from both the user and the proposal spokesperson.

Actions required by the User

1. Enter or review your registration as facility user and update as necessary.
2. Follow up in a day or a week as specified at #Follow-up actions required by the User.

Actions required by the PI or spokesperson

3. Confirm that the user has entered or reviewed their registration, and determine their badge number(s).
4. Add the user to the proposal.
  • If the proposal has not yet started: See the section above #To get started as spokesperson or delegate.
  • If the proposal has already started: Email the users' names and badge numbers to the proposal's Scientific Contact at the CNM and ask for the UWS to be augmented.

Actions required by the Scientific Contact

5. Open the UWS-SciCon form in the CNM Proposal Dashboard on the "Inside CNM" web page.
  • Add users under section "Participating Personnel".
  • Enter users' badge numbers which may have been left empty in earlier UWS submissions, as happens by necessity for very newly registered users.
  • Again, review that badge numbers are present and are correct for all relevant participants that are to work on Carbon, or at least need to use mega.

Follow-up actions required by the User

6. Contact our CNM User Office and ask to review your registration and to initiate the next steps.
  • To work onsite at CNM or remotely on Carbon, you will need an Argonne computer account, which is different from the account you used to enter your User Registration details – see section #Account Types.
  • If you are new to Argonne, our User Office will be in contact with you to set up this account. Since the Argonne account is for more general use than your registration account, it will involve formal approvals, cyber security training, etc. and typically takes about one week to complete.
7. Complete or refresh training courses, as requested by the User Office.
  • Course requirements are triggered at various levels. There are courses for being a visitor at Argonne, a user at CNM, and for work performed under proposals, be it onsite or remotely.
8. Have your Argonne computer account finalized and ready.

Carbon account and access

When all prerequisites are met, your Argonne account will be enabled or (if so needed) re-enabled for use on Carbon.

Enabling your account on Carbon does not happen immediately after a password reset or rename, or submitting proposal forms – it is a separate process.
9. Await confirmation from us that your account has been enabled on Carbon and been added to one or more proposals.
10. Review and follow the instructions on HPC/Network Access, i.e., how to connect to the cluster.

If you do not receive an expected notification within two days, contact our User Office and describe your issue.

Finalize your Argonne computer account

After your registration as a new Argonne Facility User has been processed, an Argonne "domain account" will be created for you. Its user name will be in the form b123456 and our User Office will have communicated to you an initial password.

To begin work on Carbon and possibly other CNM instruments, personalize both your password and your user name, in this order.

For Argonne personnel: User name vs. email address
  • Your user name is the one by which you log in to Argonne-internal services, and takes one of three forms. Web login services typically indicate which user name form they expect..
    • username –  This is the most typical form expected, used to log in to mega.
    • [email protected] – Your user name with a directory domain appended.
  • ANL.GOV\username – An alternate form of your user name and directory domain.
  • Your user name is not necessarily merely the name part of an Argonne email address that you will have if you work or study at Argonne.

Change your password

  • Change your password as instructed when your account got created.
    See also section #Verify or change your password below.
  • If you chose to call the Argonne Help Desk at +1-630-252-9999, option 4, to make the password change:
    While on the phone regarding your password, state that you are a CNM Carbon User and ask to also personalize your user name.

Change your user name

  1. Request a user name change, either over the phone or by email. We will choose the user name for you.
    To request by phone
    Call the Argonne Help Desk at +1-630-252-9999, and select option 4.
    To request by email
    Follow this link: Open email template – (Use a web browser that supports "mailto:" links.)
    1. Respond from the same email address as you filled in at the CNM User Registration form.
      In your email application, choose or switch to the mail account that handles the address you registered with at our User Registration. Messages pertaining to your Argonne activities that are sent from an unrelated email account may be disregarded (not taken up) by our help desk staff, for obvious reasons.
    2. Fill in your name and badge number in the email template that opens, then send the email.
  2. Await an emailed ticket number from Argonne's service desk, or follow the instructions of the person you called.
    • Ticket numbers are assigned upon manual review, typically within an hour or so during business hours.
    • If you did not get a ticket number after such reasonable time, re-send the request, making sure you do so from the email account you provided at User Registration.
  3. Check your email for follow-up questions and respond.
  4. Await the completion notice for your user name change.

Next steps

Carbon access is not immediately enabled once your user name change request has been completed – it is a separate process.

  1. Expect an email that your Carbon access has been granted.
  2. Proceed under #Carbon account and access.

Proposal troubleshooting

Email the CNM User Office with any questions or concerns, such as:

  • about proposals,
  • about users, or
  • you find that responding to any of your submissions or previous communications takes longer than a few business days.

Only our User Office staff is able to review all aspects of your proposal or your user access requirements, and determine any steps that have yet to be taken or need to be refreshed.

Include in your message:

  • Proposal number(s),
  • Name of the Principal Investigator (PI),
  • Names and badge numbers of participants.

Ongoing user access requirements

For you to log into Carbon and its SSH gateway, a number of criteria must be met, most of which are subject to expiration dates and require action from you for renewal.

You will not get notified (for various reasons) on some of these expirations. Different duration terms may cause expirations to happen right in the middle of your proposal's lifecycle and can cause immediate inconvenience. Depending on which criterion expired, one or both of the following will happen to your Argonne computer account:

  • The account's permission to use mega will be revoked.
  • The account as such will be disabled, equivalent to revoking all its permissions.

To recover, do one or more of the following:

User Registration

  1. Review and update your User Registration.
    • For this system, log in with your badge number and the password associated with it, not your regular Argonne account – see section #Account Types above.
    • The registration itself requires renewal at least every 2 years, or earlier when any one of various prerequisite items expires.
    • For non-US citizens present in the US: Review and update registration items regarding your US visa or related work permit. A current US work or study status is required to access Argonne computers, the same as if you were to visit in person.
  2. Ask our CNM User Office staff to review your registration updates and to recreate or re-instate your Argonne account, as the case might be.

Legal User Agreement

  • If you changed your affiliation (the institution where you work or study), check to see if your new institution has a legal User Facility Agreement with Argonne in place and request one if not.
    Be advised that the process may take several weeks to percolate through legal and adminstrative channels.

Training

  • Review and renew your User Courses.
    For remote users, the ESH223 course "Cybersecurity Annual Education and Awareness" is the one most likely to be in need of renewal.

Password

  • Review and update the password to your Argonne computer account if you now have or previously had such an account.
    If your account's password expired, we consider the account dormant and will not notify you about new proposals where you are listed as a participant.

Active proposal

You must be a participant in at least one active or recently expired User Proposal. You may run compute jobs under a given proposal while it is active, i.e., within the dates stated in the proposal's User Work Approval (UWA).

  • To review the dates for your proposals, ask your PI to search their email archive for subjects "Work Approval Received" or "Proposal Expiration".

Data-only access for inactive proposals

After your last active proposal on Carbon has expired you may still access, for up to 30 days, the SSH gateway and Carbon's login nodes. Past that time window, your access will be revoked and your data may be deleted, following CNM's Data Retention Policy.

  • While you still have access, offload from Carbon all your files and data that you may wish to keep.
    The CNM cannot be expected or held responsible to store your data beyond your access window.

Practical hints

  • Set yourself calendar entries about one year into the future to remind yourself to renew any of your user registration or training requirements.
  • After you were added to a user proposal, wait at least an hour or more before trying to access our SSH gateway, preferably until the next morning.
    Updates of your status need to be propagated through a handful of systems, each being done about hourly, so it may take several cycles for your status change to reach mega.

Login issues

When you ask "I cannot log in" or "My password does not work", consider the following sections:

Review your access requirements

See section #Ongoing user access requirements above.

Verify or change your password

Do one of the following:

  • Visit https://mypassword.anl.gov/
    • Complete the Password Enrollment process at this service on your first visit when you know your password. You will be (or have been) thus instructed when your account has been created.
    • You can verify or change your existing known password.
    • You can reset or unlock your password only after you enrolled.
    • You cannot use this service if your account has been disabled as such, in which case revisit the section #Ongoing user access requirements.
  • You can change your known password also at our SSH gateway host mega.
    • At least one of your CNM proposals must still be active.
    • You cannot log in if your account is disabled as such.
Error messages for login failures will not disclose a specific reason. You cannot tell if merely your password was wrong or your account is disabled.

After a password update, you should be able to connect to mega normally. If you find that you cannot log in, however, retry only after an hour or two, as in some cases extra time is needed for passwords or other account changes to propagate.

Password lockouts

After trying incorrect passwords several times, your account may be temporarily locked out. To recover, try one of the following:

Review host names

Connect to the correct host names:

  • When connecting from outside Argonne, at least two ssh sessions are required.
    • mega.cnm.anl.gov – an ssh "tunnel setup" connection.
    • carbon – one or more "payload" connections for the ssh, scp, or sftp commands. Here, carbon is not a hostname but an entry in your ssh configuration that you must make (called a host alias or "profile", depending on the ssh application). The entry stands for a connection to localhost (your machine), at a port number forwarded by the preceding tunnel setup connection.
  • When onsite (for any user) or using VPN (for Argonne staff):
    • carbon.cnm.anl.gov. The previous name was clogin. ….

To learn more, read HPC/Network Access.

Review network configuration

Request support

Expected files or commands are unavailable

When you cannot create or edit files or directories under your home directory, or you find that commands you expect are not found, then:

  1. Inspect the beginning of your command line, called the prompt:
    • If your prompt mentions "@mega" or "GATEWAY", e.g.:
    username@mega5 CNM-GATEWAY $
    then you are working not on Carbon just yet, but rather on its SSH gateway. Visit HPC/Network_Access to learn how to connect properly.
    • If the prompt mentions "@login", e.g.:
    username@login6 … $
    then there may be a temporary file system issue which may resolve after a few minutes.
    If such issue persists or recurs over several hours, however, read the message that will have appeared right after you logged in, above the first command prompt. To locate these messages, scroll up the content of your terminal application window, or log out and log back in to bring up these messages again.

My home directory is read-only

I cannot edit my .bashrc file

Expected commands unavailable

In all these situations, see the beginning of the current section.

Mailing lists

Announcements about Carbon are made on the cnm-hpc-announce mailing list, hosted at Argonne. The mailing list home page and its archive are, unfortunately, only accessible from onsite or (for authorized users) over VPN.

  • To unsubscribe from the mailing list, do one of the following:
    1. Open the "unsubscribe" link at the bottom of a recent message that you received from the mailing list.
    2. Send a blank message to cnm-hpc-announce-leave@lists.anl.gov and follow up on the confirmation notice.
  • To subscribe to the mailing list, send a blank message to cnm-hpc-announce-join@lists.anl.gov and follow up on the confirmation notice.
  • To change your email address on the list, simply do both of the above, in order.
Hints
  • When sending commands by email, be sure to have the relevant email account selected in the "From" line of the compose window in your email application.
  • Inspect the Junk mail folder of your email application if you do not receive a confirmation message for subscribing or unsubscribing within a minute or so.
  • See the GNU Mailman documenation for background.

Applications

I'd like to use application X

Check if the application is already available on Carbon

Either:

module avail
module -l avail 2>&1 | less
The second form gives you browsable output.
If you cannot find the application on Carbon
  • Submit a support request.
    • Provide one or more URLs relevant to software you have in mind – be specific.
    • Describe the problem you are trying to solve – it may well be that we can suggest an alternative solution.
    • Give the extent of your planned use.
If you see the application on Carbon but you cannot access it
  • Existing license agreements may cover only a subset of users (typically Argonne employees).
  • If you feel you are eligible, submit a support request.
If a version newer than the installed one on Carbon is available

How do I run application X?

  • Customize your shell environment to load the application module.
  • Learn about module conventions on Carbon.
  • To determine the names of a package's executable scripts and binaries, load the application module (if you have not yet done so in your shell setup), then inspect the module's $NAME_HOME/bin directory. For instance, for the Quantum-ESPRESSO package:
module load quantum_espresso
ls $QUANTUM_ESPRESSO_HOME/bin

How do I use application X?

Read the package's documentation, using one or more of the following:

  • Inspect the package's $NAME_HOME/share or $NAME_HOME/doc directory on Carbon (see module conventions).
  • Browse the package's web page, generally mentioned in the module help text or the application catalog entry.
  • Consult a package's man pages. Few packages have them. Man page files are generally installed under $NAME_HOME/man or $NAME_HOME/share/man and if so, will be made available automatically to the man command.

What's my account balance?

Simple answer: mybalance

To find out how many core-hours you have available, the simplest command to run is:

mybalance -h
Project  Machines Balance    
-------- -------- ---------- 
user     ANY         993.26
cnm34567 ANY       158760.93
cnm31234 ANY      -148893.62

The table gives all the Projects you have access to (for use with the qsub -A argument), and their balance. Machine lists all systems that can book jobs against your allocations. Carbon is currently the only machine that can do so. Balance is your account balance, in core-hours, as selected by the -h command option. This is the most useful and recommended unit. Without -h, you get core-seconds, which are integers but rather more unwieldy numbers.

  • The "user" project provides you with a small initial startup allocation of typically 1000 core-hours.
  • When a Balance is reported as negative, that account typically has a CreditLimit assigned, which permits the balance to dip below zero. These details, however, are not shown by mybalance.

Complete answer: gbalance

To get allocation details for accounts that have CreditLimits, run the gbalance command. Pass on -u username or -p projectname to select your allocations:

gbalance -h -u $USER
Type $USER as shown. The command interpreter will fill in your actual username.

The ouput looks like:

Id  Name     Amount     Reserved Balance    CreditLimit Available
--- -------- ---------- -------- ---------- ----------- --------- 
100 cnm31234 -148893.62     0.00 -148893.62   150000.00   1106.38
217 kpelzer      993.26     0.00     993.26        0.00    993.26 
123 cnm34567  166440.93  7680.00  158760.93        0.00 158760.93 

The most relevant column for you is Available. The units, given the -h option, are again core-hours.

The colums and their meanings are:

Id
an internal number for the account.
Name
The project name (for use with qsub -A or #PBS -A).
Amount
Amount for transactions completely on the books for the project account; does not include running jobs or credits. Deposits are allocated by the User Office and implemented by the Carbon administrator.
Reserved
Amounts held in reserve by all running jobs using this account. The reserve ensures that a job does not cause an overdraft when it finishes and when its actual use will be booked. The quantity is calculated by walltime * number of cores blocked. When a job terminates, the charge according to the actual time used will be subtracted from Amount, and the unused quantities will be re-added to Available.
Balance
Available for new jobs; may go negative if CreditLimits are in place.
Balance = Amount - Reserved
CreditLimit
Amount by which Balance may go negative; assigned by the Carbon administrator.
Available
Relevant quantity for new jobs. Must be positive for a new job to start, and large enough to Reserve the entire job.
Available = Balance + CreditLimit

Why did my account balance suddenly drop?

The compute time physically available by Carbon's processors is a "perishable" resource — CPU-hours pass by regardless of being used or not. Therefore, your allocations are time-restricted in a "use-it-or-lose-it" manner. This is done to encourage consistent use of the machine throughout allocation cycles.

Otherwise, resource contention becomes likely near the end of an allocation cycle, one year after each bunch of proposals has started, during which it would be increasingly impractical and eventually physically impossible to use up a large remaining allocation within a short time. You would need to claim a large fraction of Carbon's nodes during a relatively short time window, which is unlikely to be possible because jobs from other users will be running as well.

Allocation expiration schedule

Unused allocations will expire, following to schedule:

  • Your allocation is provided in three equal-sized installments.
  • All installments are active from the beginning.
  • Installments expire in a staggered fashion, currently after 4, 8, and 12 months, respectively. A diagram might illuminate this:
    Proposal                Proposal
     start                 expiration
       |-------|-------|-------|------> Time    
       0       4       8       12      (months)
       |
Installment
       |
    (1)|########....................               KEY
       |                            
    (2)|################............               .  Installment is inactive
       |                            
    (3)|########################....               #  Installment is active
       |

  • Your jobs will, sensibly, be booked against the installments that expire the earliest.

My question is not answered here

See HPC/Support.