Computing

This is a page with a collection of links to materials that I have found helpful for scientific computing.

Document Processing

LyX

GUI editor for LaTex-style documents.

LibreOffice

Never waste money on Microsoft products again.

TexWorks

Full-featured LaTex editor.

JabRef

BibTex references manager.

Tricks:

Data Analysis in Python

Anaconda

The most user-friendly Python environment, includes almost all the packages you will need. I work in Spyder, although many people use PyCharm.

Python Codecademy Tutorial

Think Stats

Statistics in IPython.

“Natural Language Processing with Python” book

Recommended Modules:

  • pandas (data management)
  • scikit-learn (machine learning)
  • NLTK (natural language processing)
  • gensim (topic modelling and word embeddings)
  • statsmodels (statistics)
  • robobrowser (web scraping)

Relational Databases

PostgreSQL Tutorial

psycopg2

Connect to a Postgres server in Python.

pandas.read_sql

Import SQL data as a pandas DataFrame.

Miscellaneous

FileZilla

FTP and SFTP access.

Unison

Synchronize files across machines.

Git

The industry standard for version control.

tmux

Persistent SSH access with session multiplexing (see here and here).

pdftk

PDF tool for batch split, merge, etc.

ImageMagick

Batch image processing.

VLC Media Player

Open-source video player. OpenSUSE Media Codecs.

Running Linux programs on Windows:

Music

Audacity

  • The best, easiest, open-source, platform-independent audio recording application.

Ardour

  • Open-source digital audio workstation (a steep learning curve!).

Law and Public Policy Lab (out of date)

OpenSUSE Appliance

Full-stack operating system for social science research (setup shell commands).

To run as a virtual machine, use VirtualBox.

Text Analysis Platform

Integrated code base for text analysis (see Wiki page).