This is a page with a collection of links to materials that I have found helpful for scientific computing.

Document Processing

LyX

GUI editor for LaTex-style documents.

LibreOffice

Never waste money on Microsoft products again.

TexWorks

Full-featured LaTex editor.

JabRef

BibTex references manager.

Tricks:

Data Analysis in Python

Cheat Sheet (Data Structures, Numpy, Pandas)

Anaconda

The most user-friendly Python environment, includes almost all the packages you will need. I work in Spyder, although many people use PyCharm.

Python Codecademy Tutorial

Think Stats

Statistics in IPython.

“Natural Language Processing with Python” book

Recommended Modules:

  • pandas (data management)
  • scikit-learn (machine learning)
  • NLTK (natural language processing)
  • gensim (topic modelling and word embeddings)
  • tensorflow keras (neural nets)
  • huggingface transformers (transformer models)
  • statsmodels (statistics)
  • selenium (web scraping)

Upgrading your code (Mauro Luzzatto)

Relational Databases

PostgreSQL Tutorial

psycopg2

Connect to a Postgres server in Python.

pandas.read_sql

Import SQL data as a pandas DataFrame.

Miscellaneous

FileZilla

FTP and SFTP access.

Unison

Synchronize files across machines.

Git

The industry standard for version control.

tmux

Persistent SSH access with session multiplexing (see here and here).

pdftk

PDF tool for batch split, merge, etc.

ImageMagick

Batch image processing.

VLC Media Player

Open-source video player. OpenSUSE Media Codecs.

Running Linux programs on Windows:

Music

Audacity

  • The best, easiest, open-source, platform-independent audio recording application.

Ardour

  • Open-source digital audio workstation (a steep learning curve!).