Updates from July, 2007 Hide threads | Keyboard Shortcuts

  • BOSCing 

    Jason Stajich 6:26 am on July 19, 2007 Permalink | Reply

    (Picture from Todd ’s iPhone during BOSC in Vienna today).

    I give a BioPerl update talk today and looking forward to catching up with friends over dinner and beers.

    Other people like Greg are blogging about the conference and we’ll hopefully put up some pictures at the OBF gallery as soon as they are available.

     
    • Chris Fields 11:21 pm on July 19, 2007 Permalink | Reply

      I think me, Sendu, and the rest of the BioPerl devs owe you a beer (or twenty). Thanks for doing the talk; hopefully if everything works out I’ll be there next go around!

  • Slideshare 

    Jason Stajich 5:11 pm on April 22, 2007 Permalink | Reply

    Trying out Slideshare, there is apparently now wordpress plugin so that this can be embedded.

     
  • group photo 

    Jason Stajich 3:28 pm on December 15, 2006 Permalink | Reply

    biggroup2.jpg

    The phylohackers….

     
  • Yah! Teragrid 

    Jason Stajich 12:21 am on November 15, 2006 Permalink | Reply

    I’ve been able to make the transition from the 1000 node Duke cluster to a smaller one here at Berkeley using Teragrid. What’s great about Teragrid is there are heterogeneous compute clusters with big SMP machines and 5000 node blade clusters. So I can run big memory apps or long running CPU intensive jobs without having to really to mess around too much.

    Not to say that it is all easy. Each system has its own filesystem and in some cases, own queing system. So you have to be able to deal with PBS, LSF, and I think SGE. Since I’m a cluster scavenger anyways I guess you deal with what you can get.

    I haven’t quite figured out how to deal with globus for running these types of jobs, mostly because lots of the analysis requires coordinating too many large datafiles and it is easier to stage them on a particular sites cluster.

    I don’t think know if they are ready for large scale informatics though (or at least in me distributing jobs across whole cluster). I can only have 40 jobs in the queue on the TACC system for example so if you need to run 10-20K you have to chunk things a little differently.

    All in all I am pleased, we’ll see what happens if I start trying to run annotation pipelines again since the CPU time is allocated and I’ve already eaten up 1/3 on my first foray here. There are larger allocations to apply for so maybe that will be the way to go.

    Philosophically I am not sure if general purpose clusters are the way to go for all of bioinformatic computing. It seems like there are always a variety of types of jobs: independent and parallel jobs, jobs with dependancies, large memory jobs, long running jobs, many many short running jobs…

     
  • Visualizing biology 

    Jason Stajich 4:08 am on September 16, 2006 Permalink | Reply

    Some cool stuff while at Genome Informatics conference in Hinxton.

     
c
compose new post
j
next post/next comment
k
previous post/previous comment
r
reply
e
edit
o
show/hide comments
t
go to top
l
go to login
h
show/hide help
esc
cancel