Context Navigation

Changes between Version 22 and Version 23 of CreditNew

Timestamp:: Mar 8, 2010, 4:37:13 PM (14 years ago)
Author:: davea
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

CreditNew

-                      v22
+                      v23
 For GPUs, it's given by a manufacturer-supplied formula.
+However, other factors affect application performance.
+For example, applications access memory,
+and the speed of a host's memory system is not reflected
+in its Whetstone score.
+Other factors,
+such as the speed of a host's memory system,
+affect application performance.
 So a given job might take the same amount of CPU time
 and a 1 GFLOPS host as on a 10 GFLOPS host.
 …
 Notes:
  * The peaks FLOPS of a device is single or double precision,
    whichever is higher.
+ * For our purposes, the peak FLOPS of a device
+   is single or double precision, whichever is higher.
    Differentiating between single and double would unnecessarily
    complicate things, and the distinction will disappear soon anyway.
 …
  * Limited project neutrality: different projects should grant
    about the same amount of credit per CPU hour, averaged over hosts.
+   about the same amount of credit per host-hour, averaged over hosts.
    Projects with GPU apps should grant credit in proportion
    to the efficiency of the apps.
    (This means that projects with efficient GPU apps will
    grant more credit on average.  That's OK).
+   grant more credit than projects with inefficient apps.  That's OK).
 == Peak FLOP Count (PFC) ==
 …
 the granted credit per job is adjusted
 so that the average is the same for each version.
+The adjustment is always downwards:
+we maintain the average PFC^mean^(V) of PFC() for each app version V,
+find the minimum X.
+We maintain the average PFC^mean^(V) of PFC() for each app version V.
+We periodically compute PFC^mean^(CPU) and PFC^mean^(GPU),
+and let X be the min of these.
 An app version V's jobs are then scaled by the factor
  S(V) = (X/PFC^mean^(V))
 The result for a given job J
 is called "Version-Normalized Peak FLOP Count", or VNPFC(J):
  VNPFC(J) = PFC(J) * (X/PFC^mean^(V))
+ VNPFC(J) = S(V) * PFC(J)
 Notes:
 …
    so it's probably better not to.
 == Cross-project normalization ==
 …
 then for each version V we let
 S(V) be the average scaling factor
 for that GPU type among projects that do have both CPU and GPU versions.
+for that plan class among projects that do have both CPU and GPU versions.
 This factor is obtained from a central BOINC server.
 V's jobs are then scaled by S(V) as above.
 …
 Notes:
+ * Projects will run a periodic script to update the scaling factors.
+ * Rather than GPU type, we'll probably use plan class,
+ * wu use plan class,
    since e.g. the average efficiency of CUDA 2.3 apps may be different
    than that of CUDA 2.1 apps.
  * Initially we'll obtain scaling factors from large projects
    that have both GPU and CPU apps (e.g., SETI@home).
    Eventually we'll use an average (weighted by work done) over multiple projects
    (see below).
+   Eventually we'll use an average (weighted by work done)
+   over multiple projects (see below).
 == Host normalization ==
+Assuming that hosts are sent jobs for a given app uniformly,
+then, for that app,
 hosts should get the same average granted credit per job.
 To ensure this, for each application A we maintain the average VNPFC^mean^(A),
 and for each host H we maintain VNPFC^mean^(H, A).
+The second normalization is across hosts.
+Assume jobs for a given app are distributed uniformly among hosts.
+Then the average credit per job should be the same for all hosts.
+To ensure this, for each app version V and host H
+we maintain PFC^mean^(H, A).
 The '''claimed FLOPS''' for a given job J is then
  F = VNPFC(J) * (VNPFC^mean^(A)/VNPFC^mean^(H, A))
+ F = VNPFC(J) * (PFC^mean^(V)/PFC^mean^(H, A))
 and the claimed credit (in Cobblestones) is
 …
 There are some cases where hosts are not sent jobs uniformly:
  * job-size matching (smaller jobs sent to slower hosts)
  * GPUGrid.net's scheme for sending some (presumably larger)
 …
 This can be done by dividing
 each sample in the computation of VNPFC^mean^ by WU.rsc_fpops_est
+each sample in the computation of PFC^mean^ by WU.rsc_fpops_est
 (in fact, there's no reason not to always do this).
 …
    and increases the claimed credit of hosts that are more efficient
    than average.
  * VNPFC^mean^ is averaged over jobs, not hosts.
+ * PFC^mean^ is averaged over jobs, not hosts.
 == Computing averages ==
 …
  * A given sample may be wildly off,
    and we can't let this mess up the average.
- * Averages should be weighted by job size.
 In addition, we may as well maintain the variance of the quantities,
 …
 == Compatibility ==