Too many restarts with no progress.

Message boards : Number crunching : Too many restarts with no progress.

To post messages, you must log in.

AuthorMessage
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,860,059
RAC: 1,391
Message 40960 - Posted: 14 May 2007, 18:06:45 UTC

This machine:
https://boinc.bakerlab.org/rosetta/result.php?resultid=78712074 is giving the following error:

Too many restarts with no progress. Keep application in memory while preempted.

It's a low-end machine (Celeron 900) with 448MB RAM that I build for my girlfriend's little brother. He uses it for games and the internet, so I've set R@H to run when the comp isn't in use (after 3 mins) and it is definitely set to keep the app in memory.

Is the error above because the computer has been restarted 5 times without R@H reaching its next checkpoint? I assume it's honouring the setting to keep rosie in memory?
ID: 40960 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile anders n

Send message
Joined: 19 Sep 05
Posts: 403
Credit: 537,991
RAC: 0
Message 40961 - Posted: 14 May 2007, 18:17:02 UTC - in response to Message 40960.  
Last modified: 14 May 2007, 18:17:26 UTC


-snip
Is the error above because the computer has been restarted 5 times without R@H reaching its next checkpoint? I assume it's honouring the setting to keep rosie in memory?


I have seen that happen on 1 of my slow PC-s when running big Wu-s.

Anders n

ID: 40961 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 40967 - Posted: 14 May 2007, 19:35:16 UTC - in response to Message 40960.  

Is the error above because the computer has been restarted 5 times without R@H reaching its next checkpoint? I assume it's honouring the setting to keep rosie in memory?


Yes, exactly. No reason to doubt it is keeping in memory. The task was restarted 5 times. Either because the PC was turned off, or BOINC was exited, or the WU was preempted by the user or another project and removed from memory. Each restart Rosetta found itself where it had restarted the last time (i.e. no checkpoint was reached).

On a slower machine it takes you longer on average, to reach a checkpointable point in a model. So, you've got a higher then average chance of seeing this condition. Especially if someone is installing some new gaming software and doing multiple reboots in succession or only leaving the machine on for short periods of time.
Rosetta Moderator: Mod.Sense
ID: 40967 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1832
Credit: 119,860,059
RAC: 1,391
Message 40973 - Posted: 14 May 2007, 21:22:39 UTC

fair enough - i'll have to look at getting some more ram (and a faster cpu) for it so it can crunch a bit more efficiently, but i've got a few other remotes in line to upgrade first!
ID: 40973 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Too many restarts with no progress.



©2025 University of Washington
https://www.bakerlab.org