September,2024 | |
---|---|
16 Sep 1:00 pm 4:00 pmIntro to Linux Command LineWorking with many of the HPC systems (like those at SciNet) involves using the Linux/UNIX command line. This provides a very powerful interface, but it can be quite daunting for the uninitiated. In this half-day session, you can become initiated with this course which will cover basic commands. It could be a great boon for your productivity.Format: Virtual Virtual | SCMP101 - Sep 2024 |
23 Sep 12:30 pm 2:00 pmIntro to Supercomputing, session 1/3An introduction to basic concepts in High-Performance Computing (HPC). This is intended to be a high-level primer for those largely new to HPC. Topic will include motivation for HPC, available HPC resources, essential issues, problem characteristics as they apply to parallelism and a high-level overview of parallel programming models.Format: Virtual Virtual | HPC101 - Sep 2024 |
25 Sep 12:30 pm 2:00 pmIntro to Supercomputing, session 2/3An introduction to basic concepts in High-Performance Computing (HPC). This is intended to be a high-level primer for those largely new to HPC. Topic will include motivation for HPC, available HPC resources, essential issues, problem characteristics as they apply to parallelism and a high-level overview of parallel programming models.Format: Virtual | HPC101 - Sep 2024 |
27 Sep 12:30 pm 29 Sep 2:00 pmIntro to Supercomputing, session 3/3An introduction to basic concepts in High-Performance Computing (HPC). This is intended to be a high-level primer for those largely new to HPC. Topic will include motivation for HPC, available HPC resources, essential issues, problem characteristics as they apply to parallelism and a high-level overview of parallel programming models.Format: Virtual | HPC101 - Sep 2024 |
October,2024 | |
2 Oct 2:25 pmYour views on this course opensPlease let us know your opinions about the course by completing these questions: | HPC101 - Sep 2024 |
4 Oct 1:00 pm 3:00 pmSecuring File Access Permissions on LinuxDid you know the Linux operating system has built-in tools to control which specific users and groups can access which files and directories? In this session, you will learn what these Linux permissions are, how to use the available tools to control access and sharing, and how to avoid common security pitfalls. | SCMP283 - Oct 2024 |
14 Oct 11:55 pmYour views on this course closesPlease let us know your opinions about the course by completing these questions: | HPC101 - Sep 2024 |
25 Oct 1:00 pm 4:00 pmLinux Shell ScriptingLearn how to write bash scripts, use environment variables, how to control process, and much more. Requires some Linux basic command line experience.Format: Virtual Virtual | SCMP201 - Oct 2024 |
28 Oct 1:00 pm 4:00 pmHPC205 Become an HPC ProLearn how to fully utilize the power of HPC. Discover proven strategies and tools to efficiently scale up from serial jobs to parallel runs across many compute nodes on Niagara. | HPC205 - Oct 2024 |
28 Oct 7:30 pmAttendance quiz closesLearn how to fully utilize the power of HPC. Discover proven strategies and tools to efficiently scale up from serial jobs to parallel runs across many compute nodes on Niagara. | HPC205 - Oct 2024 |
November,2024 | |
5 Nov 1:00 pmAttendance test for November 5 opensWhat is the attendance code? | SCMP142 - Nov 2024 |
5 Nov 1:00 pm 2:00 pmIntro to Programming with PythonNew to programming? Learn the basics of programming using python in eight one-hour sessions over the course of four weeks. Sessions will consist of a mix of lectures and hands-on exercises.Format: In-person. Sessions will be recorded. SciNet Teaching Room | SCMP142 - Nov 2024 |
5 Nov 2:35 pmAttendance test for November 5 closesWhat is the attendance code? | SCMP142 - Nov 2024 |
6 Nov 1:00 pm 4:00 pmGIT Version ControlUsing version control for your scripts, codes, documents, papers, and even data, allows you to track changes, keep backups, and facilitate collaboration. In this workshop, you will learn the basics of version control with the popular distributed version control software GIT. This workshop assumes that students have an understanding of basic Linux shell commands.Format: Virtual Virtual | DAT161 - Nov 2024 |
7 Nov 1:00 pm 2:00 pmIntro to Programming with PythonNew to programming? Learn the basics of programming using python in eight one-hour sessions over the course of four weeks. Sessions will consist of a mix of lectures and hands-on exercises.Format: In-person. Sessions will be recorded. SciNet Training Room | SCMP142 - Nov 2024 |
7 Nov 1:05 pmAttendance quiz lecture 2 opensWhat is the code? | SCMP142 - Nov 2024 |
7 Nov 2:15 pmAttendance quiz lecture 2 closesWhat is the code? | SCMP142 - Nov 2024 |
12 Nov 1:00 pm 2:00 pmIntro to Programming with PythonNew to programming? Learn the basics of programming using python in eight one-hour sessions over the course of four weeks. Sessions will consist of a mix of lectures and hands-on exercises.Format: In-person. Sessions will be recorded. SciNet Teaching Room | SCMP142 - Nov 2024 |
12 Nov 1:00 pmAttendance quiz lecture 3 opensEnter the attendance code. | SCMP142 - Nov 2024 |
12 Nov 2:32 pmAttendance quiz lecture 3 closesEnter the attendance code. | SCMP142 - Nov 2024 |
13 Nov 12:00 pm 1:00 pmSetting Up Compute Infrastructure for Sensitive Data (Compute Ontario Colloquium)We introduce a secure computing enclave at SciNet High-Performance Computing Consortium. Code-named S4H, this environment is already available to groups at the University of Toronto as a pilot project. S4H aims to meet researchers’ needs for hosting and working with sensitive data, which SciNet’s main cluster, Niagara, does not accommodate. In the first part (Yohai), we’ll delve into the technical details. We’ll explain how S4H is different from Niagara in that the data are encrypted at rest and access is hardened, and what that means in practice. We will talk about the difficulties of providing isolation for different research groups on a shared system, and explore the different components that make it possible, such as key management and containerization mechanisms. The second part (Shawn) will focus on adopting the Cybersecurity Maturity Model Certification (CMMC) framework. We’ll describe our journey deciphering the control set’s complexities, developing metadata for organizing remediation efforts, and crafting Plans of Action and Milestones for compliance gaps. Future steps include internal and potentially external assessments to verify compliance, with initiatives like Privacy Impact Assessment and penetration testing, with the eventual goal of being certified for Level 4 data. Virtual | COCO - 13 Nov 2024 |
14 Nov 1:00 pm 2:00 pmIntro to Programming with PythonNew to programming? Learn the basics of programming using python in eight one-hour sessions over the course of four weeks. Sessions will consist of a mix of lectures and hands-on exercises.Format: In-person. Sessions will be recorded. SciNet Training Room | SCMP142 - Nov 2024 |
14 Nov 1:00 pmAttendance quiz lecture 4 opensEnter the attendance code for lecture 4 | SCMP142 - Nov 2024 |
14 Nov 2:30 pmAttendance quiz lecture 4 closesEnter the attendance code for lecture 4 | SCMP142 - Nov 2024 |
25 Nov 1:00 pm 4:00 pmIntro to Linux Command LineWorking with many of the HPC systems (like those at SciNet) involves using the Linux/UNIX command line. This provides a very powerful interface, but it can be quite daunting for the uninitiated. In this half-day session, you can become initiated with this course which will cover basic commands. It could be a great boon for your productivity.Format: Virtual Virtual | SCMP101 - Nov 2024 |
26 Nov 1:00 pm 2:00 pmIntro to Programming with PythonNew to programming? Learn the basics of programming using python in eight one-hour sessions over the course of four weeks. Sessions will consist of a mix of lectures and hands-on exercises.Format: In-person. Sessions will be recorded. SciNet Teaching Room | SCMP142 - Nov 2024 |
27 Nov 12:00 amAssignmen 10 - new is dueDue date: April 6th at midnight (Thursday night).In this assignment we will perform a clustering analysis on some codon data. Codons, for those unfamiliar, are sequences of three nucleotides that form a unit of genetic information. Since there are 4 nucleotide bases (A, C, T, G), there are 4*4*4=64 possible codons. You can read more about codons here. The data for this assignment were originally taken from here, though the original source is here. I have modified the data to simplify it somewhat; the modified data can be found here. The data consists of codon relative-frequency data for 12,788 species of organisms. We will perform a clustering analysis on these data to determine if there are any clusters of significance and which 'Kingdoms' (types of organisms), are most-commonly in those clusters. 0) You must use version control ("git"), as you develop your code. We suggest you start, from the Linux command line, by creating a new directory, e.g. assignment9, cd into that directory and initialize a git repository ("git init") within it, and perform "git add ..., git commit" repeatedly as you add to your code. You will hand in the output of "git log" for your assignment repository as part of the assignment. You must have a significant number of commits representing the modifications, alterations and changes to your code. If your log does not show a significant number of commits with meaningful comments you will lose marks. 1) Create a file named codon_utilities.py containing the following functions. 1a) Write a function which takes a single string argument, the name of the data file. The function should load the data, separate the data into the features (the codon columns) and the targets (the 'Kingdom' column), and return them. The pandas package contains functionality for reading CSV files. The read_csv and drop functions might be useful. You may hard-code the column names involved. 1b) Write a function which, given some input data and a percentage, will perform a principle component analysis. The function should keep only those priciple components which explain the percentage of the data's variance indicated by the second argument. The function should then return the model. 1c) Write a function which, given some input data, will perform 10-fold cross validation on \\(k\\)-Means models, with \\(k\\) values ranging from 1 to 20. The function should generate a plot of cross-validation score versus \\(k\\). 1d) Write a function which takes some input data and a value of \\(k\\). The function will create a \\(k\\)-Means model for the given value of \\(k\\), train it on the data, and return the model. 1e) Write a function which takes as arguments a trained \\(k\\)-means model, some data, and a vector of targets. The function should determine the Kingdom with the highest population in each cluster. The function should print out the name of the Kingdom and the percentage of the population of the cluster that it represents. The collections package's Counter function is useful here. The predict function, associated with the \\(k\\)-means model, is also needed. 2) Create a Python script called codon_analysis.py that will perform the following steps: load the modified data file, linked above, run PCA on the features, keeping those principle components which explain 90% of the data's variance. create a transformed version of the features, using the principle components from the PCA. run 10-fold cross-validation on \\(k\\)-means models, using the transformed data, to determine which value of \\(k\\) would be the ideal number of clusters for this data set. At this point you should examine the figure which is generated by the function. Using your expertise, pick an ideal value of \\(k\\) for this data set. Put an explanation of your choice in the comments of your driver script. Create a new \\(k\\)-Means model, using your choice of ideal value of \\(k\\) and the transformed data. Print out the Kingdom in each cluster which has the largest population. Your script should output something like this, when run from the shell terminal: $ $ python codon_analysis.py Cluster: 0 Kingdom: Vertebrate Cluster: 1 Kingdom: Plant Cluster: 2 Kingdom: Invertebrate Cluster: 3 Kingdom: Bacteria Cluster: 4 Kingdom: Bacteria Cluster: 5 Kingdom: Virus $ Be sure to comment and document your functions. Defensive programming is not needed for this assignment. Submit your codon_utilities.py, codon_analysis.py files, your cross-validation figure, and the output of git log from your assignment repository. Both Python code files must be added and committed frequently to the repository. To capture the output of git log use redirection (git log > git.log, and hand in the git.log file). Assignments will be graded on a 10 point basis. Due date is April 6th 2023 (midnight), with 0.5 penalty point per day off for late submission until the cut-off date of April 13th 2023, at 11:00am. | MSC1090 - Fall 2024 |
28 Nov 1:00 pm 2:00 pmIntro to Programming with PythonNew to programming? Learn the basics of programming using python in eight one-hour sessions over the course of four weeks. Sessions will consist of a mix of lectures and hands-on exercises.Format: In-person. Sessions will be recorded. SciNet Teaching Room | SCMP142 - Nov 2024 |
29 Nov 1:00 pm 4:00 pmIntro to ApptainerContainer computing is gradually changing the way researchers are developing, sharing, and running software applications. Apptainer (formerly called Singularity) is gaining popularity in HPC for its performance, ease of use, portability, and security. In this course, we will explore: what is a container, why use a container, and how to use and create one.Format: Virtual | SCMP161 - Nov 2024 |
December,2024 | |
3 Dec 1:00 pm 2:00 pmIntro to Programming with PythonNew to programming? Learn the basics of programming using python in eight one-hour sessions over the course of four weeks. Sessions will consist of a mix of lectures and hands-on exercises.Format: In-person. Sessions will be recorded. SciNet Teaching Room | SCMP142 - Nov 2024 |