minirosetta 2.05

Message boards : Number crunching : minirosetta 2.05

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 10 · Next

AuthorMessage
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 64951 - Posted: 13 Jan 2010, 18:11:01 UTC

This app update includes a fix for checkpointing.

Please report issues and bugs here!

thanks,

DK
ID: 64951 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Sarel

Send message
Joined: 11 May 06
Posts: 51
Credit: 81,712
RAC: 0
Message 64953 - Posted: 13 Jan 2010, 19:21:01 UTC

Hi,

I'll be resubmitting the *gbnnotyr* protein design trajectories to boinc over the next few hours. The tests I ran on ralph showed that the checkpointing issue is resolved. To make sure that there are no other issues, I will submit these trajectories 'slowly' starting with a modest sized batch, and according to the responses I get on the thread I will increase the number of work units over the next few days. Please keep me posted about these problems. Your reports have been invaluable in tracking this problem down!

Sarel.
ID: 64953 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hellotheworld

Send message
Joined: 27 Feb 08
Posts: 3
Credit: 729,194
RAC: 0
Message 64959 - Posted: 14 Jan 2010, 9:03:30 UTC - in response to Message 64951.  

This app update includes a fix for checkpointing.

Please report issues and bugs here!

thanks,

DK


Hi,

I have a strange graphic I wanted to show you... I *think* there *might* be a problem...

Please go to see this sreen shoot :

http://www.flickr.com/photos/37828392@N08/4273
(Capitain Flam is my account on Flickr)


Possible bug for the application BOINC / ROSETTA, because the protein is *completely* folded, in a tiny meat ball ;-)

I hope this is NOT a bug, or even, I hope it will help you to solve it ;)
ID: 64959 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hellotheworld

Send message
Joined: 27 Feb 08
Posts: 3
Credit: 729,194
RAC: 0
Message 64960 - Posted: 14 Jan 2010, 9:23:40 UTC - in response to Message 64959.  

This app update includes a fix for checkpointing.

Please report issues and bugs here!

thanks,

DK


Hi,

I have a strange graphic I wanted to show you... I *think* there *might* be a problem...

Please go to see this screen shoot :

http://www.flickr.com/photos/37828392@N08/4273

(Capitain Flam is my account on Flickr)


Possible bug for the application BOINC / ROSETTA, because the protein is *completely* folded, in a tiny meat ball ;-)

I hope this is NOT a bug, or even, I hope it will help you to solve it ;)


Sorry, I didn't cut'n'paste well the link... Here it is !

http://www.flickr.com/photos/37828392@N08/4273113531/

Sorry sorry sorry :-|
ID: 64960 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 64967 - Posted: 14 Jan 2010, 14:35:05 UTC

Bad news guys just woke up today and my homopt_cstmc WU is stuck @ 40% using no CPU time. Although 3-4 other different named WU's have gone through and been totally fine. Just thought id let you know.
ID: 64967 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 64969 - Posted: 14 Jan 2010, 16:40:36 UTC

Admin, please double check the application version those are running under. (it is shown in the tasks tab of the advanced view under the application column)
Rosetta Moderator: Mod.Sense
ID: 64969 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
hellotheworld

Send message
Joined: 27 Feb 08
Posts: 3
Credit: 729,194
RAC: 0
Message 64971 - Posted: 14 Jan 2010, 16:58:37 UTC - in response to Message 64969.  

Admin, please double check the application version those are running under. (it is shown in the tasks tab of the advanced view under the application column)


About
http://www.flickr.com/photos/37828392@N08/4273113531/

I confirm running under :
Rosetta mini 2.03
ID: 64971 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 64972 - Posted: 14 Jan 2010, 17:05:05 UTC

I can 100% confirm i am/was running the new version mini rosetta 2.05 when i got the stuck homopt WU. Heres the WU link: https://boinc.bakerlab.org/rosetta/workunit.php?wuid=282419440. A wingman seems to have also had a compute error, but I can confirm i was running the updated 2.05 client.
ID: 64972 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Rabinovitch
Avatar

Send message
Joined: 28 Apr 07
Posts: 28
Credit: 5,439,728
RAC: 0
Message 64974 - Posted: 14 Jan 2010, 17:10:04 UTC

New app working well. And it seems that now the WU need less RAM (about 100 MB per WU). Is it true? If it is, then may be this is a step to rosetta's GPU client? :-)
ID: 64974 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 64975 - Posted: 14 Jan 2010, 17:14:42 UTC
Last modified: 14 Jan 2010, 17:19:18 UTC

Although I didnt grab a screenshot the task details of the work unit show "application version 2.05" You can check it out at https://boinc.bakerlab.org/rosetta/result.php?resultid=310562856. I wish i could give you guys more information, anything else i can do to help you guys solve this issue? All other work so far has gone through fine, but upon further investigation the common factor is windows 7. I have a boinc_filtered loopbuild_threading running now at 33% which gave me problems on 2.03, so i will see how it goes on 2.05 and give an update.
ID: 64975 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Oxfez

Send message
Joined: 28 May 07
Posts: 1
Credit: 161,558
RAC: 0
Message 64977 - Posted: 14 Jan 2010, 19:43:55 UTC

One of my tasks has "meatballed" too:

lr5_no_pro_close_no_dun_A_rlbd_1rnb_SAVE_ALL_OUT_IGNORE_THE_REST_DECOY_16701_583_0

Running new 2.05

According to the time to completion, it's going to be a long old process too.
ID: 64977 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Sarel

Send message
Joined: 11 May 06
Posts: 51
Credit: 81,712
RAC: 0
Message 64979 - Posted: 14 Jan 2010, 20:47:33 UTC - in response to Message 64974.  

Thanks! If these were the *gbn* runs, then they have a low-memory step which is memory efficient, but then they /might/ go on to a memory intensive step requiring 300-500Mb...

New app working well. And it seems that now the WU need less RAM (about 100 MB per WU). Is it true? If it is, then may be this is a step to rosetta's GPU client? :-)


ID: 64979 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Evan

Send message
Joined: 23 Dec 05
Posts: 268
Credit: 402,585
RAC: 0
Message 64984 - Posted: 15 Jan 2010, 0:40:43 UTC - in response to Message 64975.  

Although I didnt grab a screenshot the task details of the work unit show "application version 2.05" You can check it out at https://boinc.bakerlab.org/rosetta/result.php?resultid=310562856. I wish i could give you guys more information, anything else i can do to help you guys solve this issue? All other work so far has gone through fine, but upon further investigation the common factor is windows 7. I have a boinc_filtered loopbuild_threading running now at 33% which gave me problems on 2.03, so i will see how it goes on 2.05 and give an update.


I wouldn't worry about it. A number of these have failed. I have just sent in two that failed on their second run.

ID: 64984 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 64985 - Posted: 15 Jan 2010, 1:08:25 UTC

While The boinc_filtered WU went through fine, i have another that has stalled: opttest2.2d4f..... just thought id give an update, it froze at 18.046%. Other than that 2.05 seems stable although sometimes the graphics crash when i try to look at them.
ID: 64985 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 64986 - Posted: 15 Jan 2010, 3:42:21 UTC
Last modified: 15 Jan 2010, 3:43:42 UTC

Just had to shut down boinc, which i did properly to run a few programs quickly. Seems both Wu's the computer was working on started from model 0 when the client restarted. Both units were between 10-15 models done for being around 20% complete which they are currently (20% complete and now working on model 1). Did the units really just start over from 0 and erase all the previous work? Is this another issue we are tracking? Just trying to be helpful!
ID: 64986 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1233
Credit: 14,324,975
RAC: 3,637
Message 64987 - Posted: 15 Jan 2010, 3:55:59 UTC

In another thread, I've seen something about workunits using one of the new features not having working checkpointing while that feature is running. Checkpointing still works for workunits that don't use that feature.
ID: 64987 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Admin

Send message
Joined: 13 Apr 07
Posts: 42
Credit: 260,782
RAC: 0
Message 64988 - Posted: 15 Jan 2010, 4:02:26 UTC

I was reading the 2.03 thread and saw something about the checkpoint issue, which i saw with myself just now thats why I thought I would point it out. Your saying everything is fine even though the model says its starting from 1 again correct? Thanks for the help!
ID: 64988 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mad_Max

Send message
Joined: 31 Dec 09
Posts: 209
Credit: 26,262,530
RAC: 19,111
Message 64993 - Posted: 15 Jan 2010, 15:12:43 UTC - in response to Message 64974.  

New app working well. And it seems that now the WU need less RAM (about 100 MB per WU). Is it true? If it is, then may be this is a step to rosetta's GPU client? :-)


I too notice that version 2.05 uses less RAM, and not only on tasks *gbn*. Somewhere 200-250 MB instead of 300-350 in version 2.03.
Is it one of "and other minor updates" about which is written in "Version Release Log"?
If so it seems to me not absolutely "minor" :)
ID: 64993 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mad_Max

Send message
Joined: 31 Dec 09
Posts: 209
Credit: 26,262,530
RAC: 19,111
Message 64994 - Posted: 15 Jan 2010, 16:05:29 UTC

I noticed such thing in the new version (though it can feature of the concrete WU - this type of WU in version 2.03 did not come across to me). At model calculation at first steps go very fast, for example 36000 steps have been calculated all for 6 minutes after that calculation has gone very slowly and following 10 steps have occupied more than 10 minutes.
And it is conceived?
Task example: job_boinc_1bm8__broker_random_pairings_from_psipred_16 906_1305_1
ID: 64994 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 64995 - Posted: 15 Jan 2010, 16:45:34 UTC

Please don't presume that the information from the Project Team is an inaccurate description and that your memory observations are a new and permanent condition for all to enjoy going forward. As Sarel points out, they introduced a new type of work unit which has a new low-memory phase to execution. And so you are only going to see the lower memory usage when that specific type of task is being worked on. And this new type of work unit was introduced in prior versions, so the actual delta to v2.05 is small. Since this new type of work is a current area of review, you may see a high concentration of this type of work for a period of time. But it doesn't mean we can presume more then was stated.
Rosetta Moderator: Mod.Sense
ID: 64995 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
1 · 2 · 3 · 4 . . . 10 · Next

Message boards : Number crunching : minirosetta 2.05



©2024 University of Washington
https://www.bakerlab.org