7 Jun 1:30 pm 2:50 pmVirtual | COSS2024![Show in Google map](https://www.scinethpc.ca/wp-content/plugins/amr-ical-events-list/images/map_16.png) |
10 Jun 9:00 am 12:00 pmSome popular Python libraries for data analytics, like Numpy, Pandas, Scikit-Learn, etc., usually work well if the dataset fits into the RAM on a single machine. When dealing with large datasets, it could be a challenge to work around memory constraints. This course introduces scalable and accelerated data analytics with Dask and RAPIDS. Dask provides a framework and libraries that can handle large datasets on a single multi-core machine or across multiple machines on a cluster. RAPIDS, on the other hand, can accelerate your data analytics by offloading analytics workloads to GPUs with less effort in code changes. Level: Introductory Length: Two 3-Hour Sessions (2 Days) Format: Lecture + Hands-on Prerequisites: Alliance Account Basic Python and Linux command line experience. (part of the 2024 Compute Ontario Summer School) Virtual | COSS2024![Show in Google map](https://www.scinethpc.ca/wp-content/plugins/amr-ical-events-list/images/map_16.png) |
10 Jun 9:00 am 12:00 pmThis workshop introduces the topic of text mining and its applications. It covers different encoding mechanisms to convert text into numbers that algorithms can handle. It gives an overview of different text mining tasks, including de-identification, sentiment analysis and document clustering, and how they work with examples and live demos. There will also be references to state-of-the-art tools and libraries to conduct various text mining tasks. Level: Introductory Length: 3 Hours Format: Lecture + Hands-on Prerequisites: Basic Python (part of the 2024 Compute Ontario Summer School) Virtual | COSS2024![Show in Google map](https://www.scinethpc.ca/wp-content/plugins/amr-ical-events-list/images/map_16.png) |
10 Jun 1:30 pm 4:30 pmHave you ever tried to run someone else’s code and it just didn’t work? Have you ever been lost interpreting your colleague’s data? This hands-on session will provide researchers with tools and techniques to make their research process more transparent and reusable in remote computing environments. You’ll be using platforms like JupyterHub and command-line tools like Bash and Docker in a Linux environment to interact with the material through various exercises and examples. In this workshop, you’ll learn about: organizing your file directories writing readable metadata with README files automating your workflow with scripts capture and share your computational environment Level: Introductory Length: 3 hours Format: Lecture + Hands-on Prerequisites: Initial familiarity with command line tools and/or a Linux environment may be beneficial but not mandatory (part of the 2024 Compute Ontario Summer School) Virtual | COSS2024![Show in Google map](https://www.scinethpc.ca/wp-content/plugins/amr-ical-events-list/images/map_16.png) |