Open tmozgach opened 6 years ago
TO-DO: try parallelized LDA model https://radimrehurek.com/gensim/models/ldamulticore.html#module-gensim.models.ldamulticore
Running on Cedar (First trial, in /home/tmozgach/scratch/TM
):
TM_job.sh
#!/bin/bash
#SBATCH --time=04:00:00
#SBATCH --account=def-emodata
#SBATCH --mem=2000
python ./TopicModeling.py
echo 'I finished'
by command
sbatch TM_job.sh
for TPostRaw.csv
(only title and post, NO comments)
Cedar is a heterogeneous cluster suitable for a variety of workloads; Registration link: http://www.rcg.sfu.ca/hpc/cedar/register-cc. Registration Tutorial: https://www.youtube.com/watch?v=EtFUvBYaaZE&list=PLlIsA3IgWjnweFiHlOjBmsF2NTeJ6f5rT
Log in on Mac or Linux:
How to deal with software:
https://docs.computecanada.ca/wiki/Available_software
Transfer files from laptop to Cedar sever:
Transfer files from Cedar to laptop
OR
How to set up Globus: https://docs.computecanada.ca/wiki/Globus
Full webinars video: https://docs.computecanada.ca/wiki/Getting_Started_with_the_new_National_Systems
Install Python module inside Cedar: https://docs.computecanada.ca/wiki/Python
Main set up: Once:
Every time when you log in:
Install Python module:
You MUST install the following python packages in Cedar in YOUR virtualenvironment: