Chapter 2 Structure of the Guidebook and FAQs

2.1 What do we need for lab sessions?

For our labs we are going to use R, RStudio, the basic UNIX command line, and MATLAB. We will also use atom, a basic text editor that’s widely used in programming to view and edit files.

We will use the local cluster called SMAUG (See Figure 1). SMAUG is a computer that the Tabima lab hosts that has 32 processors, more than 14 TB of storage and 128 GB of RAM. Enough to do any kind of analyses.

2.2 Why are we using these programs?

While most of our stuff will be executed online, I thought it would be great that we learn a bit about programming and reproducible science.

R and RStudio will allow us to do that. With R we can create code that can be reproducible and we can use for future analisis. With RStudio we will have an easy usable user interface to create notebooks like this one!

MATLAB will also give us the opportunity to learn some programming skills and get some exposure to another program widely used by computational and mathematical biologists.

Atom will help us read files we wouldn’t be able to read otherwise. As we go forward with class we will see some examples of these files.

2.3 Do we need to know how to program?

No. We will learn how to do some basic programming and how to use the command line (i.e. how to use basic commands, how to create folders, execute programs and create loops).

Being recursive and using the resources we can is very important. Sometimes things won’t work as expected and we MUST find an alternative. This is one of those cases.