bcharlier / HMMA238

This is the public repo for the course HMMA238 'Software Development'
10 stars 48 forks source link

HMMA 238

(Almost) everything you need to know as an applied mathematician / statistician concerning coding and system administration.

Teachers

with the help of some students including:

Web page

https://github.com/bcharlier/HMMA238

https://github.com/HMMA238-2021/

Prerequisite

Students are expected to know basic notions of probabilities, optimization, linear algebra and statistics for this course. Some rudiments on coding is also expected (if, for, while, functions) but not mandatory.

Course description

This course focuses on discovering good coding practices (the language used being Python, but some element of bash and git will also be useful) for professional coding. A special focus on data processing and visualization will be at the heart of the course. We will mostly focus on basic programming concepts, as well as on discovering the Python scientific libraries, including numpy, scipy, pandas, matplotlib, seaborn. Beyond pandas ninja skills, we will also introduce modern practices for coders : (unitary) tests, version control, documentation generation, etc.

(Tentative) Course schedule

  1. BC : (20/01) Introduction to linux essentials and command line tools: bash,

  2. BC : (21/01) Introduction to linux essentials and command line tools: regexp, grep, find, rename,

  3. BC : (27/01) IDE: VScode, Python virtual env: Anaconda, Python virtual environment, Git: a first introduction, github, ssh key creation, various git commands, conflict, pull request; see also Bonus/

  4. JS : (28/01) Coding : algorithms, modules, basic types, functions, loops coding : list, dictionary, tuples, if statement and loops, exceptions

  5. BC : (03/02) hands on git

  6. JS : (04/02) numpy : basics on matrices (arrays), slicing, simple linear algebra, masking; matplotlib: first plots

  7. BC : (10/02) Some git again,

  8. JS : (11/02) numpy : casting, concatenation;

  9. JS : (17/02) numpy / matplotlib: imshow, meshgrid, copy;

  10. JS : (18/02) scipy: EDO, Interpolation, Optimize

  11. BC : (03/03) classes (__init__, __call__, etc...), operator overloading, files handling, Create a Python Module

  12. JS : (04/03) scipy: Images/channel, Pandas: first steps / missing data

  13. BC : (10/03) Create a Python Module

  14. BC : (17/03) unit tests

  15. JS : (18/03) Sparse matrices,

  16. JS : (31/03) graphs and memory

  17. JS : (01/04) Numba, parallelism

  18. BC : (07/04) Documentation with Sphinx

  19. JS : (08/04) Statsmodels

  20. JS-BC : (19/04) Oral examination

  21. JS-BC : (20/04) Oral examination

Grading

Challenge (25% of the final grade)

Tests (15% of the final grade)

Three short tests of 15 min each (on Moodle). This will be a personal work.

Project (60% of the final grade)

Warning: the precise details of the projects might evolve before the allocation phase, and a precise grid will be given in the project section.

Warning: the project repository must show a balanced contribution between group members and intra-group grades variation could be made to reflect issues on the intra-group workload balance.

Bonus

1 supplementary point on the final grade of the course can be obtained for contributions improving the course material (practicals, Readme, etc.). See the Bonus section for more details on how to proceed.

Books and other resources

The resources for the course are available on the present github repository. Additional elementary elements (in French) on Python are available in the course HLMA310 and the associated lectures notes IntroPython.pdf.

Additional resources

Oldies (for jupyter notebook extensions)

Some useful extensions:

conda install -c conda-forge jupyter_contrib_nbextensions
conda install -c conda-forge nbstripout