[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[creduce-dev] parallel tuning

To: creduce-dev@flux.utah.edu
Subject: [creduce-dev] parallel tuning
From: John Regehr <regehr@cs.utah.edu>
Date: Tue, 17 Nov 2015 11:00:01 +0100
List-archive: </listarchives/creduce-dev>
List-help: <mailto:creduce-dev-request@flux.utah.edu?subject=help>
List-id: C-Reduce Development Mailing List <creduce-dev.flux.utah.edu>
List-post: <mailto:creduce-dev@flux.utah.edu>
List-subscribe: <http://www.flux.utah.edu/mailman/listinfo/creduce-dev>, <mailto:creduce-dev-request@flux.utah.edu?subject=subscribe>
List-unsubscribe: <http://www.flux.utah.edu/mailman/options/creduce-dev>, <mailto:creduce-dev-request@flux.utah.edu?subject=unsubscribe>
User-agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.11; rv:38.0) Gecko/20100101 Thunderbird/38.3.0

C-Reduce's strategy of querying the number of CPUs and running that manyparallel reduction attempts is bad in some cases, such as on my Macbookwhere it runs with concurrency 8, where 3 would be a better choice.

We did a bunch of benchmarking of this a few years ago but I'm afraidthat the results are very specific to not only the platforms but alsothe interestingness tests. Some of those have very light cachefootprints whereas others (for example those that invoke staticanalyzers) tend to blow out the shared cache.

My current idea is that first we need to detect real cores instead ofhyperthreaded cores, which is sort of a pain but we can special-case MacOS and Linux I guess. Then maybe something like:


- parallelism 2 on a dual core
- 3 on a 4-core
- 4 on a >4 core

How does this match with your experience?

John

Follow-Ups:
- Re: [creduce-dev] parallel tuning
  - From: Markus Trippelsdorf <markus@trippelsdorf.de>
- Re: [creduce-dev] parallel tuning
  - From: Eric Eide <eeide@cs.utah.edu>

Prev by Date: Re: [creduce-dev] pass_ints.pm: Nit?
Next by Date: Re: [creduce-dev] parallel tuning
Previous by thread: Re: [creduce-dev] pass_ints.pm: Nit?
Next by thread: Re: [creduce-dev] parallel tuning
Index(es):
- Date
- Thread