Useful UNIX tools

Updated and presented by: Tracy Teal

SSH

What is ssh?

ssh stands for ‘secure shell’. It’s a way of connecting to another computer without sending your passwords in the clear. telnet, for example, is like ssh but your password is not encrypted when it’s sent.

We’re going to ssh into a Amazon instance that was set up for this class.

If you want more information on Amazon instances, Titus has a nice reference here: http://ged.msu.edu/angus/tutorials/renting-a-computer-from-amazon.html

You can log in to this machine by typing

The password is the name of this institute.

Now you’re on the Amazon instance!

Here we see the files we saw in the 1-Shell lesson and we can do all the things here to them that we could on our systems yesterday. Run through a few of the shell commands we did yesterday.

Screen

Now say you’re on this instance and you want to run a program that’s going to take a long time, and you want to be able to log out of the instance and still have it run. Here is where ‘screen’ is your friend.

Type

screen

You’ll get the prompt back again and you can do just what you normally do. Type ‘ls’ for instance. Now, though, type ‘exit’. You’re still at a prompt. All you did was exit out of screen.

Start screen again, by typing ‘screen’

Do an ‘ls’ again. Now we want to detach from the screen, so that the process could keep running while we log off of Amazon or any remote machine. To detach

Ctrl-A Ctrl-D

If you want to reattach to that screen, type

screen -r

If you have multiple screens running, it will tell you that and you have to pick

screen -r screen-number

That’s all there is to screen.

A test example

Now let’s try a test example of logging on to a remote machine, running screen, running a blast at the command line, detaching from the screen and then coming back to check on the results.

Follow the instructions above for loggin in and starting screen

Now to run command line blast

You can download command line blast from NCBI. It’s called ‘blastall’

If you type blastall - You get all the options

This is a standard blastall command

blastall -e 1e-05 -p tblastx -d test_fasta -i seqs.fa -o seqs.blast

-e is the e-value cutoff you want to use. Any matches higher than that will not be returned -p is the program - tblastx, blastx, blastn or tblastn -d is the database -i is the input file -o is the output file -m is the output type you want

If you’re parsing the output, then you want to use -m 8. It outputs a tab delimited format that’s easy to look through The default shows you all the alignments

If you do use -m 8 this is the information in each column

# Query id # Subject id # % identity # alignment length # number of mismatches # number of gap openings # position of query start # position of query end # position of subject start # position of subject end # e-value of a hit # bit score of a hit

So, we’ll run blastall on a test dataset

There are some test fasta files in the ‘fasta’ directory

Go in to that directory. You can use any of these files as the input fasta file.

The blast database is: /home/swc/blastdb/nirK_ref_Rh

blastall -e 1e-05 -p tblastx -d /home/swc/blastdb/nirK_ref_Rh -i oneseq.fasta -o seqs.blast

comments powered by Disqus

Table Of Contents

This Page




Edit this document!

This file can be edited directly through the Web. Anyone can update and fix errors in this document with few clicks -- no downloads needed.

  1. Go to Useful UNIX tools on GitHub.
  2. Edit files using GitHub's text editor in your web browser (see the 'Edit' tab on the top right of the file)
  3. Fill in the Commit message text box at the bottom of the page describing why you made the changes. Press the Propose file change button next to it when done.
  4. Then click Send a pull request.
  5. Your changes are now queued for review under the project's Pull requests tab on GitHub!

For an introduction to the documentation format please see the reST primer.