Context Navigation

Changes between Version 2 and Version 3 of CreditNew

Timestamp:: Oct 30, 2009, 3:54:58 PM (14 years ago)
Author:: davea
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

CreditNew

-                      v2
+                      v3
 = New credit system design =
 == Introduction ==
 We can estimate the peak FLOPS of a given processor.
+== Peak FLOPS and efficiency ==
+BOINC estimates the peak FLOPS of each processor.
 For CPUs, this is the Whetstone benchmark score.
 For GPUs, it's given by a manufacturer-supplied formula.
 …
 is the ratio of actual FLOPS to peak FLOPS.
 GPUs typically have a much higher (50-100X) peak speed than CPUs.
+GPUs typically have a much higher (50-100X) peak speed than GPUs.
 However, application efficiency is typically lower
 (very roughly, 10% for GPUs, 50% for CPUs).
+== Credit system goals ==
+Some possible goals in designing a credit system:
+ * Device neutrality: similar jobs should get similar credit
+   regardless of what processor or GPU they run on.
+ * Project neutrality: different projects should grant
+   about the same amount of credit per day for a given host.
+It's easy to show that both goals can't be satisfied simultaneously
+when there is more than one type of processing resource.
 == The first credit system ==
+In the first iteration of credit system, "claimed credit" was defined as
+In the first iteration of BOINC's credit system,
+"claimed credit" was defined as
 {{{
 C1 = H.whetstone * J.cpu_time
 }}}
 There were then various schemes for taking the
 average or min of the claimed credit of the
 replicas of a job, and using that as the "granted credit".
+average or min of the claimed credit of the replicas of a job,
+and using that as the "granted credit".
 We call this system "Peak-FLOPS-based" because
 …
 The problem with this system is that, for a given app version,
 efficiency can vary widely.
+efficiency can vary widely between hosts.
 In the above example,
+host B would claim 10X as much credit,
+and its owner would be upset when it was granted
+only a tenth of that.
+the 10 GFLOPS host would claim 10X as much credit,
+and its owner would be upset when it was granted only a tenth of that.
 Furthermore, the credits granted to a given host for a
 series of identical jobs could vary widely,
 depending on the host it was paired with by replication.
+So host neutrality was achieved,
+but in a way that seemed arbitrary and unfair to users.
+This seemed arbitrary and unfair to users.
 == The second credit system ==
+To address the problems with host neutrality,
+we switched to the philosophy that
+We then switched to the philosophy that
 credit should be proportional to number of FLOPs actually performed
 by the application.
 …
 SETI@home had an application that allowed counting of FLOPs,
 and they adopted this system.
 They added a scaling factor so that the average credit
 was about the same as in the first credit system.
+They added a scaling factor so that the average credit per job
+was the same as the first credit system.
 Not all projects could count FLOPs, however.
 …
  * It didn't address GPUs.
  * project that couldn't count FLOPs still had host neutrality problem
  * didn't address single replication
+ * Project that couldn't count FLOPs still had device neutrality problems.
+ * It didn't prevent credit cheating when single replication was used.
 …
    change code, settings, etc.
+ * Device neutrality: similar jobs should get similar credit
+   regardless of what processor or GPU they run on.
+ * Device neutrality
  * Limited project neutrality: different projects should grant
 …
 == Peak FLOP Count (PFC) ==
 This system uses to the Peak-FLOPS-based approach,
+This system goes back to the Peak-FLOPS-based approach,
 but addresses its problems in a new way.
 When a job is issued to a host, the scheduler specifies usage(J,D),
 J's usage of processing resource D:
 how many CPUs, and how many GPUs (possibly fractional).
+how many CPUs and how many GPUs (possibly fractional).
 If the job is finished in elapsed time T,
 …
    (e.g., a CPU job that does lots of disk I/O)
    PFC() won't reflect this.  That's OK.
+   The key thing is that BOINC reserved the device for the job,
+   whether or not the job used it efficiently.
  * usage(J,D) may not be accurate; e.g., a GPU job may take
    more or less CPU than the scheduler thinks it will.
 …
    For now, though, we'll just use the scheduler's estimate.
+The idea of the system is that granted credit for a job J
+is proportional to PFC(J),
+The idea of the system is that granted credit for a job J is proportional to PFC(J),
 but is normalized in the following ways:
 == Version normalization ==
+== Cross-version normalization ==
 …
 find the minimum X,
 then scale each app version's jobs by (X/PFC*(V)).
 The results is called NPFC(J).
+The result is called "Version-Normalized Peak FLOP Count", or VNPFC(J).
 Notes:
 …
    or new app versions are deployed.
 == Project normalization ==
+== Cross-project normalization ==
 If an application has both CPU and GPU versions,
 …
 The solution to this is: if an app has only GPU versions,
+then we scale its granted credit by a factor,
+obtained from a central BOINC server,
+which is based on the average scaling factor
+then we scale its granted credit by the average scaling factor
 for that GPU type among projects that
 do have both CPU and GPU versions.
+This factor is obtained from a central BOINC server.
 Notes:
 …
 For a given application, all hosts should get the same average granted credit per job.
 To ensure this, for each application A we maintain the average NPFC*(A),
 and for each host H we maintain NPFC*(H, A).
+To ensure this, for each application A we maintain the average VNPFC*(A),
+and for each host H we maintain VNPFC*(H, A).
 The "claimed credit" for a given job J is then
 {{{
 NPFC(J) * (NPFC*(A)/NPFC*(H, A))
 }}}
 Notes:
  * NPFC* is averaged over jobs, not hosts.
  * Both averages are recent averages, so that they respond to
    changes in job sizes and app versions characteristics.
+VNPFC(J) * (VNPFC*(A)/VNPFC*(H, A))
+}}}
+Notes:
+ * VNPFC* is averaged over jobs, not hosts.
+ * Both averages are exponential recent averages,
+   so that they respond to changes in job sizes and app versions characteristics.
  * This assumes that all hosts are sent the same distribution of jobs.
    There are two situations where this is not the case:
    a) job-size matching, and b) GPUGrid.net's scheme for sending
    some (presumably larger) jobs to GPUs with more processors.
    To deal with this, we'll weight the average by workunit.rsc_flops_est.
+   To deal with this, we can weight jobs by workunit.rsc_flops_est.
 == Replication and cheating ==
 …
 by claiming excessive credit
 (i.e., by falsifying benchmark scores or elapsed time).
 An exaggerated claim will increase NPFC*(H,A),
+An exaggerated claim will increase VNPFC*(H,A),
 causing subsequent claimed credit to be scaled down proportionately.
 This means that no special cheat-prevention scheme
 …
  * One-time cheats (like claiming 1e304) can be prevented by
    capping NPFC(J) at some multiple (say, 10) of NPFC*(A).
+   capping VNPFC(J) at some multiple (say, 10) of VNPFC*(A).
  * Cherry-picking: suppose an application has two types of jobs,
         which run for 1 second and 1 hour respectively.
         Clients can figure out which is which, e.g. by running a job for 2 seconds
         and seeing if it's exited.
         Suppose a client systematically refuses the 1 hour jobs
         (e.g., by reporting a crash or never reporting them).
         Its NPFC*(H, A) will quickly decrease,
         and soon it will be getting several thousand times more credit
         per actual work than other hosts!
         Countermeasure:
         whenever a job errors out, times out, or fails to validate,
         set the host's error rate back to the initial default,
         and set its NPFC*(H, A) to NPFC*(A) for all apps A.
         This puts the host to a state where several dozen of its
         subsequent jobs will be replicated.
+  which run for 1 second and 1 hour respectively.
+  Clients can figure out which is which, e.g. by running a job for 2 seconds
+  and seeing if it's exited.
+  Suppose a client systematically refuses the 1 hour jobs
+  (e.g., by reporting a crash or never reporting them).
+  Its VNPFC*(H, A) will quickly decrease,
+  and soon it will be getting several thousand times more credit
+  per actual work than other hosts!
+  Countermeasure:
+  whenever a job errors out, times out, or fails to validate,
+  set the host's error rate back to the initial default,
+  and set its VNPFC*(H, A) to VNPFC*(A) for all apps A.
+  This puts the host to a state where several dozen of its
+  subsequent jobs will be replicated.
 == Implementation ==
+Database changes:
+New table "host_app_version"
+{{{
+int host_id;
+int app_version_id;
+double avg_vnpfc;       // recent average
+int njobs;
+double total_vnpfc;
+}}}
+New fields in "app_version":
+{{{
+double avg_vnpfc;
+int njobs;
+double total_vnpfc;
+}}}
+New fields in "app":
+{{{
+double min_avg_vnpfc;           // min value of app_version.avg_vnpfc
+}}}