ECJ-INTEREST-L Archives

September 2013

ECJ-INTEREST-L@LISTSERV.GMU.EDU

Options: Use Monospaced Font
Show Text Part by Default
Condense Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Mime-Version:
1.0 (Mac OS X Mail 6.5 \(1508\))
Sender:
ECJ Evolutionary Computation Toolkit <[log in to unmask]>
Subject:
From:
Ralf Buschermöhle <[log in to unmask]>
Date:
Sat, 7 Sep 2013 17:31:33 +0200
In-Reply-To:
Content-Type:
multipart/signed; boundary="Apple-Mail=_78FCFF2B-1AA7-4C30-9888-8C3F0A114ADE"; protocol="application/pgp-signature"; micalg=pgp-sha1
Reply-To:
ECJ Evolutionary Computation Toolkit <[log in to unmask]>
Parts/Attachments:
text/plain (1193 bytes) , noSlavesConnected.stack (2 MB) , text/plain (2 MB) , 512SlavesConnected.stack (2 MB) , text/plain (2 MB) , signature.asc (2 MB)
Hi,

On Sep 6, 2013, at 23:48, Sean Luke <[log in to unmask]> wrote:

> So in order to handle this with the threadpool meant adding some features to the threadpool that ECJ already has, which OF COURSE meant rewriting it from scratch and refactoring the various classes which use it.  :-)  I *think* the new version works but (ahem) who knows.
> 
> Here are some new classes you can try.  

:)

> Tell me if things work a bit better.
>  The general idea was to push the thread back in the thread pool when it was finished rather than killing it; that way you'd not be recreating zillions of threads which is causing the Java VM memory leak you're seeing.
> 
> <ThreadPool.java><SlaveConnection.java><SlaveMonitor.java><Slave.java><SimpleEvaluator.java>

I just tried the code - but since I am not working on the current SVN branch and I changed some things in the code for my analysis I roughly adapted some things.

Nevertheless I noticed

a.) Connects and Disconnects are fast
b.) the threadpool tends to get larger (and larger) ... and the threads are not reused? ... I attached two thread dumps 
b.1.) after a few hundred connects and disconnects with (no slave connected then dumping the threads)




  b.2.) then after 512 slaves connected - I did not expect more threads but the stack is larger
Greetings, Ralf > On Sep 6, 2013, at 10:06 AM, Sean Luke wrote: > >> I think a better approach would be to use the thread pool rather than keeping the threads static like this. But it'll take a while to whip it up. >> >> Sean >> >> On Sep 6, 2013, at 9:43 AM, Ralf Buschermöhle wrote: >> >>> Hi, >>> >>> A little ... more elegant & less performance consuming solution was to change writeLoop() and return 'false' when waitOnMonitor returns one (receiving an interrupt): >>> >>> boolean writeLoop() >>> { >>> Job job = null; >>> >>> try >>> { >>> synchronized(jobs) >>> { >>> // check for an unsent job >>> if ((job = oldestUnsentJob()) == null) // automatically marks as sent >>> { >>> // failed -- wait and drop out of the loop and come in again >>> debug("" + Thread.currentThread().getName() + "Waiting for a job to send" ); >>> if (!slaveMonitor.waitOnMonitor(jobs)) // changed >>> return false; // new >>> } >>> ... >>> >>> Avoiding a busy waiting ... writer.isInterrupted ... of the previous version. >>> >>> Ralf >>> >>> On Sep 6, 2013, at 14:41, Ralf Buschermöhle <[log in to unmask]> wrote: >>> >>>> Hi, >>>> >>>> I added "!writer.isInterrupted()" to stop the (write)loop and now the threads are stopped. >>>> >>>> In context "SlaveConnection.java" >>>> >>>> writer = new Thread() >>>> { >>>> public void run() >>>> { >>>> while( !writer.isInterrupted() ) >>>> writeLoop(); >>>> } >>>> }; >>>> >>>> Greetings, >>>> >>>> Ralf >>>> >>>> On Sep 6, 2013, at 11:59, Ralf Buschermöhle <[log in to unmask]> wrote: >>>> >>>>> Hi, >>>>> >>>>> unfortunately adding memory does not solve the problem. >>>>> >>>>> It does not seem to be heap related ... I jstacked after 56 slaves, after 500 adding and removing (also waited for the disconnect messages) slaves and after the next 500 I received the Exception. >>>>> >>>>> And the number of threads did not reduce after waiting. >>>>> >>>>> I attached the thread dumps. >>>>> >>>>> It's the slaveMonitor ... who does not receive an interrupt? >>>>> >>>>> public boolean waitOnMonitor(Object monitor) >>>>> { >>>>> try >>>>> { >>>>> monitor.wait(); >>>>> } >>>>> catch (InterruptedException e) { return false; } >>>>> return true; >>>>> } >>>>> >>>>> Greetings, >>>>> >>>>> Ralf >>>>> >>>>> >>>>> <56.stack><56+-500.stack><Exception.stack> >>>>> >>>>> >>>> >>> >

ATOM RSS1 RSS2