SERVER PROBLEMS.

Message boards : Number crunching : SERVER PROBLEMS.

To post messages, you must log in.

Previous · 1 . . . 8 · 9 · 10 · 11 · 12 · Next

AuthorMessage
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2130
Credit: 41,424,155
RAC: 16,102
Message 63188 - Posted: 7 Sep 2009, 12:53:18 UTC - in response to Message 63182.  

Is there a problem, none of my rigs are getting any work this morning i see

I'm not entirely sure. I got one or two similar messages over the weekend, but WUs seemed to come through after a few tries and hardly seemed to impact my buffer at all. It's not happening here at all now. Problem solved?
ID: 63188 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 274
Credit: 26,399,595
RAC: 0
Message 63197 - Posted: 7 Sep 2009, 21:31:21 UTC

There are not many WU to go around at the moment,
But that can easy change.

Database status
State Approximate #results
Ready to send 6
In progress 769,856
ID: 63197 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 63231 - Posted: 9 Sep 2009, 21:37:46 UTC
Last modified: 9 Sep 2009, 21:52:59 UTC

The validator looks like it has all but stopped, i returned this one yesterday

9 Sep 2009 6:28:39 UTC, others are having the same problems.

See this thread.

https://boinc.bakerlab.org/rosetta/forum_thread.php?id=5054

Edit// just to add the TeraFLOPS estimate:32.701 has dropped from the high 90.
ID: 63231 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 63287 - Posted: 11 Sep 2009, 21:56:41 UTC

It seems things have gone from bad to worse.

A lot of these going round Workunit error - check skipped

The TeraFLOPS estimate:17.....

I need a holiday!

ID: 63287 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 163
Credit: 808,337
RAC: 3
Message 63289 - Posted: 11 Sep 2009, 23:22:45 UTC - in response to Message 63287.  

It seems things have gone from bad to worse.

The TeraFLOPS estimate:17.....


Could these messages from the front page have something to do with it?
Sep 10, 2009
The validator and scheduler servers are currently slowly processing a large work unit. We have reprioritized the WU after finding that it is causing server problem. However, it will take a while for the existing jobs to clean out. Meanwhile, server lags are expected.

Sep 11, 2009
Based on the current rate of data crunching, the server lag problem should be alleviated through this weekend.

TeraFLOPS estimate:15.881
Have a crunching good day!!
ID: 63289 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile rochester new york
Avatar

Send message
Joined: 2 Jul 06
Posts: 2842
Credit: 2,020,043
RAC: 0
Message 63298 - Posted: 12 Sep 2009, 1:45:38 UTC - in response to Message 63289.  

t flops read 22 now 9/11/09 2144 Eastern Daylight Savings Time hopefully that means good news


It seems things have gone from bad to worse.

The TeraFLOPS estimate:17.....


Could these messages from the front page have something to do with it?
Sep 10, 2009
The validator and scheduler servers are currently slowly processing a large work unit. We have reprioritized the WU after finding that it is causing server problem. However, it will take a while for the existing jobs to clean out. Meanwhile, server lags are expected.

Sep 11, 2009
Based on the current rate of data crunching, the server lag problem should be alleviated through this weekend.

TeraFLOPS estimate:15.881

ID: 63298 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Chilean
Avatar

Send message
Joined: 16 Oct 05
Posts: 711
Credit: 26,694,507
RAC: 0
Message 63319 - Posted: 13 Sep 2009, 21:15:35 UTC

TFlop estimate just hit an all time low.

TeraFLOPS estimate: 8.514
ID: 63319 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AMD_is_logical

Send message
Joined: 20 Dec 05
Posts: 299
Credit: 31,460,681
RAC: 0
Message 63323 - Posted: 14 Sep 2009, 0:26:16 UTC

I got credit for a WU today:

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=253191930

Note that this WU is an old one that was never returned the first time, and was eventually resent to me. I had some similar WUs yesterday.

All of the many current WUs that I've returned are still pending.

So somehow all the current WUs are different in a way that the Validator can't handle, but old WUs validate just fine.
ID: 63323 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 63325 - Posted: 14 Sep 2009, 5:43:04 UTC
Last modified: 14 Sep 2009, 5:53:37 UTC

There no light at the end of the tunnel yet.

rah_assimilator1__bk1__Not running

As of 14 Sep 2009 5:30:19 UTC

TeraFLOPS estimate: 9.546

I personally haven't had any work Validated for day.

EDIT// Just a suggestion but you might have to stop sending out new tasks to let the validator catch up, other projects have done that.//
ID: 63325 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AMD_is_logical

Send message
Joined: 20 Dec 05
Posts: 299
Credit: 31,460,681
RAC: 0
Message 63330 - Posted: 14 Sep 2009, 12:59:28 UTC - in response to Message 63325.  

EDIT// Just a suggestion but you might have to stop sending out new tasks to let the validator catch up, other projects have done that.//

That wouldn't do any good. The Validator can't validate current tasks no matter how much time it is given. When I send back an old WU that the Validator can handle, it is promptly validated.

Example: https://boinc.bakerlab.org/rosetta/workunit.php?wuid=253620869

ID: 63330 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gray Handcock

Send message
Joined: 26 Sep 05
Posts: 20
Credit: 2,018,415
RAC: 0
Message 63532 - Posted: 30 Sep 2009, 22:13:07 UTC

Hi - not sure if this is related to the above, but I am getting this error:

Wed 30 Sep 2009 23:47:56 SAST|rosetta@home|Sending scheduler request: To fetch work. Requesting 276065 seconds of work, reporting 0 completed tasks
Wed 30 Sep 2009 23:49:03 SAST|rosetta@home|Scheduler request failed: HTTP internal server error
Wed 30 Sep 2009 23:54:41 SAST|rosetta@home|Sending scheduler request: Requested by user. Requesting 276563 seconds of work, reporting 0 completed tasks
Wed 30 Sep 2009 23:55:47 SAST|rosetta@home|Scheduler request failed: HTTP internal server error

Cheers
Gray
ID: 63532 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gray Handcock

Send message
Joined: 26 Sep 05
Posts: 20
Credit: 2,018,415
RAC: 0
Message 63538 - Posted: 1 Oct 2009, 8:13:24 UTC - in response to Message 63532.  

Hi - not sure if this is related to the above, but I am getting this error:

Wed 30 Sep 2009 23:47:56 SAST|rosetta@home|Sending scheduler request: To fetch work. Requesting 276065 seconds of work, reporting 0 completed tasks
Wed 30 Sep 2009 23:49:03 SAST|rosetta@home|Scheduler request failed: HTTP internal server error
Wed 30 Sep 2009 23:54:41 SAST|rosetta@home|Sending scheduler request: Requested by user. Requesting 276563 seconds of work, reporting 0 completed tasks
Wed 30 Sep 2009 23:55:47 SAST|rosetta@home|Scheduler request failed: HTTP internal server error

Cheers
Gray


Hmm, well whatever it was, I do seem to have got some WUs in the course of the early morning (here):

Thu 01 Oct 2009 01:30:41 SAST|rosetta@home|Scheduler request succeeded: got 5 new tasks

Still curious to know what the issue was, but at least can continue crunching :)

cheers
Gray
ID: 63538 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2130
Credit: 41,424,155
RAC: 16,102
Message 63650 - Posted: 12 Oct 2009, 14:34:02 UTC - in response to Message 63538.  

An intermittent supply of new WUs over the last 3.5 hours. Any reason for it?

There's supposed to be a fair few available according to the Server Status page (19,000).
ID: 63650 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 63672 - Posted: 13 Oct 2009, 20:51:14 UTC
Last modified: 13 Oct 2009, 21:04:07 UTC

There seems to be a number of users having the same problem.

Wed 14 Oct 2009 07:41:18 EST|rosetta@home|Sending scheduler request: To fetch work. Requesting 13964 seconds of work, reporting 0 completed tasks
Wed 14 Oct 2009 07:41:30 EST|rosetta@home|Scheduler request succeeded: got 0 new tasks

Something stuck?

EDIT// This new thread : https://boinc.bakerlab.org/rosetta/forum_thread.php?id=5102
ID: 63672 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BarryAZ

Send message
Joined: 27 Dec 05
Posts: 153
Credit: 30,843,285
RAC: 0
Message 63673 - Posted: 13 Oct 2009, 21:04:40 UTC - in response to Message 63672.  

Seems that way -- I noticed this last night as well.

There seems to be a number of users having the same problem.

Wed 14 Oct 2009 07:41:18 EST|rosetta@home|Sending scheduler request: To fetch work. Requesting 13964 seconds of work, reporting 0 completed tasks
Wed 14 Oct 2009 07:41:30 EST|rosetta@home|Scheduler request succeeded: got 0 new tasks

Something stuck?

EDIT// This new thread : https://boinc.bakerlab.org/rosetta/forum_thread.php?id=5102


ID: 63673 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
P . P . L .

Send message
Joined: 20 Aug 06
Posts: 581
Credit: 4,865,274
RAC: 0
Message 63693 - Posted: 15 Oct 2009, 1:42:41 UTC
Last modified: 15 Oct 2009, 1:47:42 UTC

Do you have a problem again, i'm getting this when returning results & get new ones.

Thu 15 Oct 2009 12:32:27 EST|rosetta@home|Scheduler request succeeded: got 0 new tasks
Thu 15 Oct 2009 12:32:27 EST|rosetta@home|Message from server: Server error: can't attach shared memory

I see the feeder is down is that the cause.

EDIT// A new app i see, that explains a lot!
ID: 63693 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2130
Credit: 41,424,155
RAC: 16,102
Message 64146 - Posted: 23 Nov 2009, 16:59:50 UTC

On the home page:
Server Status as of 23 Nov 2009 16:19:35 UTC
[ Scheduler running ]
Total queued jobs: 0


Everything running on the Server Status page but only 219 ready to send.

Houston, we have a problem...
ID: 64146 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 64155 - Posted: 23 Nov 2009, 19:19:29 UTC

Looks like 20,000 ready to send now. Number of outstanding WUs increasing as well. So work is flowing.
Rosetta Moderator: Mod.Sense
ID: 64155 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AMD_is_logical

Send message
Joined: 20 Dec 05
Posts: 299
Credit: 31,460,681
RAC: 0
Message 64408 - Posted: 8 Dec 2009, 14:13:44 UTC

The home page currently says there are zero queued jobs.
ID: 64408 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2130
Credit: 41,424,155
RAC: 16,102
Message 64409 - Posted: 8 Dec 2009, 14:52:40 UTC

Yeah, I think the last WU I got was 6 hours ago. Still a good buffer here though.
ID: 64409 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 8 · 9 · 10 · 11 · 12 · Next

Message boards : Number crunching : SERVER PROBLEMS.



©2024 University of Washington
https://www.bakerlab.org