Nuclear Engineering and Technology 49 (2017) 1172e1180<br />
<br />
<br />
<br />
Contents lists available at ScienceDirect<br />
<br />
<br />
Nuclear Engineering and Technology<br />
journal homepage: www.elsevier.com/locate/net<br />
<br />
<br />
Original Article<br />
<br />
Analysis of inconsistent source sampling in monte carlo<br />
weight-window variance reduction methods<br />
David P. Griesheimer*, Virinder S. Sandhu<br />
Naval Nuclear Laboratory, P.O. Box 79, West Mifflin, PA 15122-0079, USA<br />
<br />
<br />
<br />
<br />
a r t i c l e i n f o a b s t r a c t<br />
<br />
Article history: The application of Monte Carlo (MC) to large-scale fixed-source problems has recently become possible<br />
Received 31 May 2017 with new hybrid methods that automate generation of parameters for variance reduction techniques.<br />
Accepted 27 July 2017 Two common variance reduction techniques, weight windows and source biasing, have been automated<br />
Available online 14 August 2017<br />
and popularized by the consistent adjoint-driven importance sampling (CADIS) method. This method<br />
uses the adjoint solution from an inexpensive deterministic calculation to define a consistent set of<br />
Keywords:<br />
weight windows and source particles for a subsequent MC calculation. One of the motivations for source<br />
Monte Carlo<br />
consistency is to avoid the splitting or rouletting of particles at birth, which requires computational<br />
Variance Reduction<br />
Importance Sampling<br />
resources. However, it is not always possible or desirable to implement such consistency, which results in<br />
CADIS inconsistent source biasing. This paper develops an original framework that mathematically expresses<br />
Weight Windows the coupling of the weight window and source biasing techniques, allowing the authors to explore the<br />
impact of inconsistent source sampling on the variance of MC results. A numerical experiment supports<br />
this new framework and suggests that certain classes of problems may be relatively insensitive to<br />
inconsistent source sampling schemes with moderate levels of splitting and rouletting.<br />
© 2017 Korean Nuclear Society, Published by Elsevier Korea LLC. This is an open access article under the<br />
CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).<br />
<br />
<br />
<br />
<br />
1. Introduction source particles produced with weights that lay outside of the<br />
weight window for the corresponding birth state of the particle.<br />
Many common Monte Carlo (MC) variance reduction techniques Source particles produced with an inconsistent birth weight are<br />
rely on weight windows to control the statistical weight of particles immediately subjected to weight adjustment (splitting or roulette),<br />
during transport in order to minimize the variance of flux or re- which is widely believed to decrease the overall effectiveness of the<br />
action rate tallies in a specified region of phase space (e.g., position, weight-window variance reduction scheme.<br />
energy, direction). For these techniques, it is well known that the In 1998, Wagner and Haghighat [2] introduced the consistent<br />
optimal particle weight at each phase location in a problem is given adjoint-driven importance sampling (CADIS) method for creating<br />
by the objective-driven adjoint flux for that location [1]. Particles adjoint-based sets of weight-window parameters based off of a<br />
with weight outside of a predefined window about the optimal deterministic estimate for the adjoint flux. In addition, Wagner and<br />
weight are subjected to rouletting or splitting (which adjust the Haghighat showed that the deterministic estimate of the adjoint<br />
weight in a fair manner) in order to maintain the weight within the flux can also be used to define a biased source definition that is<br />
weight window. This weight adjustment applies at particle events consistent with the weight-window parameters. Here, consistent<br />
where the weight changes (e.g., births, collisions) as well as when means that source particles from the biased source are born with a<br />
particles move between regions of phase space with different weight that lies at the center point of the weight window corre-<br />
weight-window parameters. sponding to the initial (birth) phase state of the particle. The<br />
In early implementations of weight-window variance reduction development of a method for simultaneously creating a consistent<br />
methods, inconsistencies between the radiation source distribution source along with the weight-window parameters was a significant<br />
and the weight window parameters for the simulation resulted in advancement and is a major advantage of the CADIS method. The<br />
same consistency is also found in the forward-weighted CADIS<br />
(FW-CADIS) method used to calculate global MC solutions [3].<br />
However, even with the advancement of the CADIS method, there<br />
* Corresponding author. are some situations where it can be difficult to ensure a completely<br />
E-mail address: david.griesheimer@unnpp.gov (D.P. Griesheimer).<br />
<br />
http://dx.doi.org/10.1016/j.net.2017.07.017<br />
1738-5733/© 2017 Korean Nuclear Society, Published by Elsevier Korea LLC. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/<br />
licenses/by-nc-nd/4.0/).<br />
D.P. Griesheimer, V.S. Sandhu / Nuclear Engineering and Technology 49 (2017) 1172e1180 1173<br />
<br />
<br />
consistent source distribution, and, therefore, the CADIS method ever been performed to our knowledge. Although it appears self-<br />
cannot be applied as intended. evident that frequently adjusting the initial weight of source par-<br />
For example, the biased source produced by CADIS is based on ticles is counterproductive, it seems reasonable that a small initial<br />
an estimate of the adjoint flux distribution produced from a weight adjustment for source particles may be acceptable for many<br />
deterministic solution method e typically the discrete ordinates applications. However, such a conclusion requires a thorough<br />
(SN) method. As a result, the adjoint flux and the resulting biased characterization of the effect of inconsistent source sampling based<br />
source distribution are discretized over space, energy, and direc- on the degree of inconsistency.<br />
tion. In order to reduce the amount of time required to generate In this paper, we develop a mathematical framework for quan-<br />
weight-window parameters, a relatively coarse discretization may tifying the impact of inconsistent source sampling on the variance<br />
be used to estimate the adjoint flux [3]. Although the discretized of tallied quantities in a MC simulation. The derived relationships<br />
biased source produced from CADIS is guaranteed to be consistent are supported with results from numerical experiments and pro-<br />
with the corresponding weight-window parameters, the CADIS vide a foundation for additional analyses tailored to a variety of<br />
source does not preserve higher-order information about the specific applications.<br />
source distribution, which causes discretization error within the<br />
sampled MC source. In practice, many MC codes that use CADIS 2. Expected variance by sample scheme<br />
simply assume that source particles are uniformly distributed<br />
within each discretized “bin” of the CADIS source. However, this In this section, we derive the expected variance in estimated<br />
assumption may still lead to a bias in the results from the MC response for several different source sampling schemes. Prior to<br />
transport simulation, especially for cases where there is detailed proceeding, it is useful to define notation and significant statistical<br />
structure in the true source distribution, such as the energy spec- relationships that will be used throughout the remainder of the<br />
trum of a decay source. Any modification of the source distribution paper.<br />
to reduce this bias may lead to inconsistencies between the<br />
adjusted source and the original weight-window parameters. 2.1. Notation and basic relationships<br />
In other situations, the radiation source may be “presampled”<br />
from a preceding calculation and stored as a census file containing In MC transport methods, each history can be viewed as the<br />
detailed state information about the source particles. This scenario combination of two separate realizations: the initial (birth) state of<br />
is common when generating secondary radiations during MC the source particle, denoted x, and the response of the history as<br />
transport (e.g., (n,g) or (g,n) reactions), exchanging information measured against some predetermined objective, denoted r. In this<br />
between MC eigenvalue and fixed-source calculations, or in SN/MC context, we have assumed that the initial particle state, x, is a vector<br />
or MC/MC splice calculations where particles that reach a pre- that includes properties such as the birth energy, position, and<br />
defined “trapping surface” are stored for use in a subsequent MC direction of the particle, and that the response, r, is a scalar value.<br />
simulation [4]. In these cases, it is straightforward to collapse the Note that these are arbitrary assumptions and may be changed<br />
particle census into a discretized source representation for use in without loss of generality.<br />
CADIS. However, replacing the census source by the discretized To an external observer, ignorant of the inner workings of the<br />
representation would eliminate valuable information stored in the MC transport algorithm, it appears that each history produces a<br />
census, such as the correlations between the position, energy, and realization (x,r) from the joint probability density function (PDF)<br />
direction of each particle, and is not a practical solution for many p(x,r). Based on the properties of joint probability distributions, it<br />
splice calculations. Therefore, retaining the particle census in- follows that the expected value and variance of any function f(x,r)<br />
troduces inconsistent source sampling into the subsequent MC applied to a realization of the joint PDF is given by<br />
calculation.<br />
In addition, for some types of analyses, it is desirable to generate Z∞ Z<br />
a single set of weight-window parameters that can be used with a E½f ðx; rÞ ¼ f ðx; rÞpðx; rÞdxdr (1)<br />
range of similar model configurations, often representing source, ∞ G<br />
geometry, or composition perturbations with respect to a single<br />
reference scenario. In these cases, the CADIS method is well suited and<br />
for determining the weight-window parameters and a consistent h i<br />
source for the reference configuration, but it can become expensive Var½f ðx; rÞ ¼ E f 2 ðx; rÞ E½f ðx; rÞ2 ; (2)<br />
if the weight-window parameters and/or the consistent biased<br />
source must be regenerated for every model perturbation. In where G is the domain for the birth state of the particle.<br />
practice, a single set of weight-window parameters is often used for Note that the joint PDF p(x,r) can be written as the product of<br />
all of the model perturbations, regardless of whether each source conditional and marginal probability distributions, p(x,r) ¼ p(rjx) p(x).<br />
distribution is actually consistent with the weight-window In this case, the expected value of the function f(x,r) can be expressed<br />
parameters. as<br />
Finally, we note that, although the CADIS method has proven to Z<br />
be extremely successful, there are still weight-window variance<br />
E½f ðx; rÞ ¼ Ex ½Er ½f ðx; rÞjx ¼ Er ½f ðx; rÞjxpðxÞdx; (3)<br />
reduction techniques in use [5,6], and under development, that do<br />
not produce a consistent biased source distribution, for a variety of G<br />
reasons.<br />
where<br />
For any situation where an inconsistent source distribution may<br />
be used with weight-window variance reduction, it is important to Z∞<br />
have a clear understanding of the effects of weight adjustment via Er ½f ðx; rÞjx ¼ f ðx; rÞpðrjxÞdr; (4)<br />
splitting or rouletting immediately after particle birth. Although ∞<br />
the conventional wisdom maintains that any weight adjustment at<br />
birth will reduce the effectiveness of the weight-window variance and subscripts have been included on the expectation operators to<br />
reduction, no systematic, formal investigation of this conjecture has clarify which variable the expectation is taken with respect to. The<br />
1174 D.P. Griesheimer, V.S. Sandhu / Nuclear Engineering and Technology 49 (2017) 1172e1180<br />
<br />
" # " #<br />
relationship in Eq. (3) is commonly referred to as the fundamental<br />
1 XN<br />
1 XN<br />
Var½r<br />
property of conditional expected values, the law of total expecta- Var½m<br />
^ r ¼ Var ri ¼ 2 Var ri ¼ : (10)<br />
N i¼1 N N<br />
tion, or the law of iterated expectation. i¼1<br />
Similarly, the law of total variance can be used to express the Substituting Eq. (9) into Eq. (10) yields the final expression<br />
variance of f(x,r) in terms of the conditional probability,<br />
Ex ½Var½rjx Var½Er ½rjx <br />
Var½m<br />
^r ¼ þ : (11)<br />
Var½ f ðx; rÞ ¼ Ex ½Var½f ðx; rÞjx þ Var½Er ½f ðx; rÞjx; (5) N N<br />
<br />
where<br />
2.3. Importance sampling scheme<br />
<br />
2<br />
Var½ f ðx; rÞjx ¼ Er ðf ðx; rÞ Er ½f ðx; rÞjxÞ x : (6)<br />
The previous section established the basic estimators for the<br />
expected response and associated variance using only unbiased<br />
Note that Eq. (5) demonstrates that the total variance of f(x,r)<br />
realizations from the joint distribution p(x,r). However, it is well<br />
includes two components: the first term gives the variance due<br />
known that the variance of the expected response can be reduced<br />
only to the randomness of r for a fixed value of x, referred to as the<br />
via importance sampling [7]. In adjoint-driven importance sam-<br />
transport variance, and the second term gives the variance due to<br />
pling, the underlying source distribution (in this case, p(x)) is<br />
the effect of the randomness of x on the conditional distribution<br />
altered so that the initial particle state is drawn from a distribution<br />
p(rjx), which is referred to as the source variance. The use of the<br />
that is proportional to the corresponding response of the source. In<br />
conditional probability is often used for analysis of MC radiation<br />
order to preserve the original source distribution, the observed<br />
transport algorithms because it explicitly shows the dependence of<br />
response of the particles is weighted in proportion to the ratio of<br />
the response on the initial state of a particle.<br />
the probabilities of the initial state in the original and modified<br />
Now that the notation and relevant statistical relationships have<br />
probability distributions.<br />
been established, we will consider the variance of the estimated<br />
For the MC radiation transport process, let us define a biased<br />
response for several different source sampling schemes.<br />
source distribution p0 (x) such that<br />
<br />
2.2. Unbiased sampling scheme p0 ðx; rÞ ¼ pðrjxÞp0 ðxÞ: (12)<br />
<br />
Consider a fair MC transport process where each particle history The corresponding weighting factor for each state point is given by<br />
i begins with an initial particle state x ~ i sampled from the distri-<br />
bution p(x) and produces a corresponding response ~ri according to<br />
wðxÞ ¼ pðxÞ=p0 ðxÞ: (13)<br />
the probability distribution p(rjx). As described in the previous<br />
With importance sampling, the unbiased statistic for estimating<br />
section, the initial state and corresponding response can be viewed<br />
the expected response is<br />
as either sequential (conditional) realizations, or as a single reali-<br />
zation of the ordered pair (x ~ i ,~ri ) from the joint PDF p(x,r).<br />
1 XN 0<br />
For a simulation with N independent particle histories, an esti- b imp<br />
m r ¼ ~i ~r i ;<br />
w x (14)<br />
mate for the expected response can be computed with the unbiased N i¼1<br />
sample statistic<br />
~0i denotes a source state realization sampled from the<br />
where x<br />
1 XN<br />
modified distribution p0 (x0 ).<br />
m<br />
br ¼ ~r : (7)<br />
N i¼1 i Similarly, the sample variance statistic<br />
<br />
N 2<br />
The variance of the response can be estimated using the sample 1 X<br />
b 2<br />
s r;imp ¼ ~0i ~r i m<br />
w x b imp<br />
r ; (15)<br />
variance statistic N1<br />
i¼1<br />
<br />
<br />
1 X N is an unbiased estimator for Var[w(x0 )r], where<br />
2<br />
b<br />
sr ¼ ð~r m<br />
b r Þ2 ; (8)<br />
N 1 i¼1 i Var½wðx0 Þr ¼ Ex0 ½Var½wðx0 Þrjx0 þ Var½Er ½wðx0 Þrjx0 : (16)<br />
<br />
which is an unbiased estimator for the variance of the response r, Recognizing that the weight factor w(x0 ) is a constant with<br />
respect to the variance of r j x0 , the first term on the right-hand side<br />
Var½r ¼ Ex ½Var½rjx þ Var½Er ½rjx (9) of Eq. (16) can be rewritten as<br />
h i<br />
by the law of total variance [Eq. (5)]. Ex0 ½Var½wðx0 Þrjx0 ¼ Ex0 w2 ðx0 ÞVar½rjx0 : (17)<br />
In this context, the first term in Eq. (9) has the physical inter-<br />
pretation as the variance due to the randomness of the transport Applying the definition for the expectation operator Ex0 to Eq.<br />
process (transport variance), whereas the second term is the vari- (17) yields<br />
ance due to the randomness of the initial birth state of each source Z<br />
particle (source variance). Ex0 ½Var½wðx0 Þrjx0 ¼ Var½rjx0 w2 ðx0 Þp0 ðx0 Þdx0 : (18)<br />
Finally, we note that the variance of the sum of uncorrelated<br />
G<br />
variables is equal to the sum of the variance of the individual<br />
random variables (a relationship referred to in some texts as the Using Eq. (13) to relate p(x0 ) ¼ w(x0 )p0 (x0 ) and switching the<br />
Bienayme formula), which allows the variance of the estimator for dummy variable of integration from x0 to x allows Eq. (18) to be<br />
the mean response m b r to be written as rewritten as<br />
D.P. Griesheimer, V.S. Sandhu / Nuclear Engineering and Technology 49 (2017) 1172e1180 1175<br />
<br />
<br />
Z Z <br />
<br />
Ex0 ½Var½wðx0 Þrjx0 ¼ wðxÞVar½rjxpðxÞdx; (19) Var½wðxÞr Var½rjx<br />
¼ pðxÞdx; (24)<br />
E½r Er ½rjx<br />
G G<br />
<br />
which can be written in expectation notation as which shows that the relative variance of the total response is equal<br />
to the average of the relative variance for all of the conditional<br />
responses taken with respect to the true source distribution p(x).<br />
Ex0 ½Var½wðx0 Þrjx0 ¼ Ex ½wðxÞVar½rjx: (20)<br />
Note that the use of importance sampling for the source state<br />
Substituting Eq. (18) into Eq. (16) gives an expression for the does not eliminate the variability in observed response owing to<br />
response variance under importance sampling. the transport process (transport variance). In order to achieve a true<br />
zero-variance process, it is necessary to also adjust the transport<br />
process so that each initial state has the same observed response,<br />
Var½wðx0 Þr ¼ Ex ½wðxÞVar½rjx þ Var½wðx0 ÞEr ½rjx0 : (21) not just the same expected response as shown above.<br />
Note that the first term on the right-hand side of Eq. (21) is In consistent-source variance reduction methods, the source<br />
expressed in terms of the expectation and variance with respect to weighting function is chosen based on Eq. (23), in order to mini-<br />
the original source distribution, p(x), rather than the modified mize (or eliminate) the source variance term. It follows that this<br />
distribution p0 (x). importance weighting function does not necessarily minimize the<br />
formula for the variance of the total variance of the response, as demonstrated by the example<br />
Finally, applying the Bienayme<br />
problem described in the results section. For simplified problems, it<br />
sum of uncorrelated variables to Eq. (14) and using the result for<br />
is possible to show that a weighting function obtained by mini-<br />
Var[w(x′) r] gives the final variance for the statistic m<br />
br<br />
mizing both the transport and source variance terms in Eq. (21)<br />
produces a lower total variance. Generalized conditions for the<br />
h i E ½wðxÞVar½rjx Var½wðx0 Þ ½rjx0 optimum weighting function have not been derived. However, it<br />
x<br />
Var m^ imp<br />
r ¼ þ r<br />
: (22) should be noted that results from preliminary testing covering a<br />
N N<br />
range of possible model conditions suggest that the zero-source-<br />
Comparing Eqs. (22) and (11) illustrates the effect of importance<br />
variance weighting function given in Eq. (23) produces responses<br />
sampling via the presence of the weight parameter w(x) in each<br />
within a few percent of the optimal variance for realistic situations.<br />
term. It is particularly interesting to note that the weighting<br />
parameter affects both the transport variance and source variance<br />
2.4. Source splitting scheme<br />
terms. The effect on the transport variance term is somewhat sur-<br />
prising, as it suggests that importance sampling schemes that<br />
Next, let us consider a generalization of the expected response<br />
emphasize initial birth states with above-average response<br />
estimator in which source particles are split at birth and multiple<br />
(transport) variance may diminish the effectiveness of importance<br />
independent realizations are generated for each initial state point.<br />
sampling for reducing total variance.<br />
For an MC simulation with N0 independently sampled state<br />
The objective of the importance sampling scheme described in<br />
points and M independent realizations for each initial state point,<br />
this section is to reduce the total variance of the response by<br />
the unbiased importance sampling statistic for estimating the ex-<br />
altering the distribution of source states in an unbiased way. When<br />
pected response is<br />
importance sampling is applied to a process that involves a prob-<br />
ability distribution of only one variable, say p(x), it is known that N0 N0<br />
1 X 1 XM<br />
0 ~ 1 X ~zsplit ;<br />
the optimal weighting function for calculating the expected value b split<br />
m r ¼ wðx Þ r j;i ≡ (25)<br />
of any response function g(x) is given by w(x) ¼ E[g(x)]/g(x). In fact, N 0 i¼1 M j¼1 i N 0 i¼1 M;i<br />
this optimal weight gives Var[w(x)g(x)] ¼ 0 by ensuring that each<br />
sample gives a response that is exactly equal to the expected where ~rj;i is a realization from the conditional distribution p(rjx0 i),<br />
response E[g(x)]. split<br />
However, the identification of an optimal weighting function for and ~zM;i is a realization of the random variable obtained by taking<br />
importance sampling is not as straightforward when dealing with a the sample mean of the responses due to transporting M replicates<br />
joint probability distribution. By analogy with the single-variable ~0i . Note that the variable zsplit<br />
of initial source state x M is actually a<br />
case, it seems reasonable to assume that the weight should be function of a realization taken from the joint probability distribu-<br />
defined as tion p(x,r1, …, rM). Such a realization is not physically realistic,<br />
because it implies that a single source site will induce M response<br />
Z realizations. However, the joint realization is an accurate repre-<br />
Er ½rjxpðxÞdx sentation of MC particle transport with source particle splitting.<br />
G Furthermore, we note that the response variables r1, …, rM are<br />
wðxÞ ¼ ; (23) conditionally independent, which means that each variable is<br />
Er ½rjx<br />
conditionally dependent on the random variable x, but indepen-<br />
which is the ratio of the mean total response to the mean response dent from all of the other response variables. The property of<br />
conditioned on the initial state x. In fact, it can be immediately conditional independence allows the joint PDF for the birth state<br />
shown that the weighting function defined in Eq. (23) optimizes and M responses to be written as<br />
the variance associated with the selection of an initial source state Y<br />
M <br />
pðx; r1 ; …; rM Þ ¼ pðr1 ; …; rM jxÞpðxÞ ¼ pðxÞ p rj x : (26)<br />
by eliminating the source variance term (Var[Er[w(x′)rjx′] ¼ 0).<br />
j¼1<br />
Again, this is because the weighting function ensures that each<br />
source state contributes exactly the same expected response. Further recognizing that all of the response realizations are<br />
Applying the weight function in Eq. (23) to the equation for total taken from a common conditional PDF, p(rj j x) ¼ p(r j x) for all rj,<br />
variance of the response, Eq. (21), yields allows the joint PDF in Eq. (26) to be written as<br />
1176 D.P. Griesheimer, V.S. Sandhu / Nuclear Engineering and Technology 49 (2017) 1172e1180<br />
<br />
h i E ½wðxÞVar½rjx <br />
þ Var½wðx0 ÞEr ½rjx0 :<br />
split x<br />
pðx; r1 ; …; rM Þ ¼ pðxÞðpðrjxÞÞM : (27) Var zM ¼ (37)<br />
M<br />
Returning to the expected response estimator, m b split<br />
r , defined in Inspection of Eq. (37) indicates that source splitting only affects<br />
Eq. (25), it follows that the corresponding sample variance statistic the transport variance term, and that the total response variance<br />
per source sample will decrease as the splitting factor M increases.<br />
0 12 Equation (37) gives the variance for the mean response of M<br />
2 1 X N0<br />
@1<br />
X<br />
M 0 split A replicates for a single source state. The variance for the mean<br />
b<br />
s r;split ¼ w xi r j;i m<br />
~ br ; (28)<br />
N 0 1 i¼1 M j¼1 response over N0 independent trials, mb split , can be determined by<br />
r<br />
applying the Bienayme formula to Eq. (25), then substituting the<br />
is an unbiased estimator for Var [zsplit<br />
M ], where<br />
expression in Eq. (37), to yield<br />
<br />
h i h h ii h h ii h i E ½Var½wðxÞrjx Var½wðx0 ÞE ½rjx0 <br />
split split Var mb split ¼<br />
x<br />
þ<br />
r<br />
:<br />
¼ Ex0 Var zM x0 þ Var Er zM x0 :<br />
split<br />
Var zM (29) r<br />
MN 0 N0<br />
(38)<br />
<br />
Fortunately, Eq. (29) can be recast into a more meaningful form. A comparison of Eq. (38) with the variance for importance<br />
We begin by expanding the first term (transport variance) on the sampling without splitting [Eq. (22)] shows that the only difference<br />
right-hand side of Eq. (29) between the expressions is the presence of the 1/M factor in the<br />
first term of Eq. (38). Note that if the number of source samples are<br />
2 2 33<br />
h h ii X held constant between importance sampling with and without<br />
split 0 1 M <br />
Ex0 Var zM x ¼ Ex0 4Var4 wðx0 Þrj x0 5 5: (30) source splitting (e.g., N0 ¼ N), source splitting will cause the overall<br />
M j¼1 response variance to go down, because of a decrease in the trans-<br />
port variance term. This makes sense, as the source replicates are<br />
Recognizing that the number of replicates, M, and the weight reducing the statistical uncertainty in the response associated with<br />
factor w(x) are independent of the replicate number and response transporting radiation from each sampled initial particle state.<br />
variance allows Eq. (30) to be rewritten as However, if N0 and M are constrained such that the total amount<br />
2 2 33 of transport work is held constant (MN0 ¼ N), a question arises<br />
h h ii regarding optimal allocation of resources between M and N0 . In<br />
split 0 w 2 ðx 0 Þ XM 0<br />
Ex0 Var zM x ¼ Ex0 4 Var4 r 5 5:<br />
j x (31)<br />
M2 cases where the source variance is larger than the transport vari-<br />
j¼1 ance, the optimal allocation is to maximize the number of source<br />
Because the realizations of ri,j (i.e., rjjx) are conditionally inde- particles, N0 ¼ N, and use only one replicate for each source.<br />
formula to write However, as the magnitude of the source variance becomes small<br />
pendent, it is possible to apply the Bienayme<br />
relative to the transport variance, the penalty (i.e., increase in total<br />
h h i i E 0 w2 ðx0 ÞVar½rjx0 variance) associated with source splitting decreases. In the limiting<br />
0 x<br />
Ex0 Var zsplit<br />
M x ¼ : (32) case, when the source variance is equal to zero (i.e., all weighted<br />
M source sites produce the same expected response), the total<br />
Finally, we apply the procedure outlined in Eqs. (18e20) to write response variance is not affected by source splitting.<br />
Eq. (32) in terms of the expected value with respect to the unbiased This observation leads to several interesting conclusions. First,<br />
source distribution, p(x) this result confirms the long-held conventional wisdom that source<br />
splitting will increase the variance of tallied quantities in simula-<br />
h h i i E ½wðxÞVar½rjx <br />
0 x tions for which it is used, assuming that total work is held constant.<br />
Ex0 Var zsplit<br />
M x ¼ : (33)<br />
M However, the result also shows that the increase in variance due to<br />
splitting may be small in cases where the source variance is small<br />
Returning to the second term (source variance) in Eq. (29), we<br />
relative to the transport variance. This second result is especially<br />
can again expand this term and factor out constants to give<br />
intriguing when recalling that importance sampling is used to<br />
2 2 33 minimize the source variance term. Thus, in this regime, source<br />
h h ii 0Þ XM <br />
split 0 wðx splitting may be used to correct for an inconsistent source distri-<br />
Var Er zM x ¼ Var4 Er 4 rj x x0 5 5:<br />
0<br />
(34)<br />
M bution without a significant increase in total response variance.<br />
j¼1<br />
2.5. Source rouletting scheme<br />
Applying the definition of the conditional expected value for the<br />
response, Eq. (4), gives Let us now consider an expected response estimator in which<br />
2 3 source particles are subjected to Russian roulette at birth. For an MC<br />
M Z<br />
∞<br />
h h ii 0Þ X simulation with N0 independently sampled state points, the unbi-<br />
wðx<br />
Var Er zM x0 ¼ Var4 rj p rj x0 drj 5:<br />
split<br />
(35) ased importance sampling statistic for estimating the expected<br />
M j¼1<br />
0 response with Russian roulette is<br />
Because the random variables rj are conditionally independent and N0 ~<br />
0<br />
drawn from the same probability distribution p(r j x), it follows that b roulette 1 X ~ i ~r i<br />
bi w x 1 XN0<br />
~zroulette ;<br />
m r ¼ 0<br />
≡ (39)<br />
N i¼1 M N i¼1 M;i<br />
0<br />
2 3<br />
h h ii Z∞<br />
0 4wðx0 Þx0<br />
Var Er zsplit<br />
M x ¼ Var rpðrjx0 Þdr 5 ~0i are realizations from the joint distribution p0 (x0 ,r),<br />
where ~ri and x<br />
(36)<br />
0 ~<br />
and b is a realization from a Bernoulli distribution with probability<br />
i<br />
¼ Var½wðx0 ÞEr ½rjx0 : roulette<br />
of success given by M (for M 1), and ~zM;i is a realization of the<br />
Substituting Eqs. (33) and (36) into Eq. (29), gives the final random variable obtained by setting the response equal to zero<br />
split<br />
simplified expression for the variance of zM with probability (1 e M). Note that the random variable b<br />
D.P. Griesheimer, V.S. Sandhu / Nuclear Engineering and Technology 49 (2017) 1172e1180 1177<br />
<br />
<br />
determines whether each particle will survive the roulette process, common response estimator that can be used for both source<br />
and is completely independent from the random variables x and r. rouletting and splitting, including noninteger splitting ratios.<br />
In this case, the sample variance statistic For an MC simulation with N0 independently sampled state<br />
!2 points, the generalized sampling statistic for estimating the ex-<br />
1 X N0 ~ wðx<br />
b ~ i 0 Þ~r i<br />
2 pected response is<br />
b<br />
s r;roulette ¼ 0<br />
i<br />
m roulette<br />
b r ; (40)<br />
N 1 i¼1 M 0 0<br />
1 XN<br />
1 X<br />
QMS<br />
~0 wx 1 XN<br />
~zgeneral ;<br />
is an unbiased estimator for Var[zroulette<br />
M ], where b general<br />
m r ¼ 0 b j<br />
~ 0i ~r j;i ≡ 0 M;i (46)<br />
N i¼1 M N i¼1<br />
" <br />
2 # 2 j¼1<br />
h i bwðx0 Þr bwðx0 Þr<br />
roulette<br />
Var zM ¼E E ; (41)<br />
M M where ~rj;i is a realization from the conditional distribution p(rjx0 i),<br />
~ 0 is a realization from a Bernoulli distribution with probability of<br />
bj<br />
which can be factored to yield general<br />
success given by M/QMS, ~zM;i is a realization of the random var-<br />
h i 1 h i h i <br />
iable obtained by taking the sample mean of the responses due to<br />
¼ 2 E b2 E ðwðx0 ÞrÞ2 E½b2 E½wðx0 Þr2 ;<br />
roulette<br />
Var zM<br />
M transporting QMS replicates of the initial source site x ~ 0 , and<br />
(42) the symbol QS denotes a ceiling operation such that QMS is the<br />
smallest integer that is larger than M. As described in Section 2.4,<br />
because the realizations of (x0 ,r) are independent from the re- general<br />
the variable zM is actually a function of a realization taken from<br />
alizations of b. Recognizing that E[b2] ¼ M and E[b]2 ¼ M2 for a<br />
the conditionally independent joint probability distribution<br />
Bernoulli distributed random variable enables Eq. (42) to be<br />
p(x,r ,…,r ). In addition, the Bernoulli realizations b~ 0 are assumed<br />
simplified, giving 1 M i<br />
to be unconditionally independent from the random variables x<br />
h i 1 1M and r.<br />
¼ Var½wðx0 Þr þ<br />
roulette<br />
Var zM E½r2 : (43)<br />
M M Note that in Eq. (46) the parameter M determines how the<br />
weight adjustment is performed, with M < 1 causing source<br />
Again, the variance for the mean response over N0 independent<br />
roulette<br />
rouletting, M > 1 causing source splitting, and M ¼ 1 reverting to<br />
trials, m<br />
br , can be determined by applying the Bienayme formula<br />
standard importance sampling. It is also important to recognize<br />
to Eq. (39), then substituting the expression in Eq. (43) to yield that the estimator defined in Eq. (46) automatically accounts for<br />
h i 1 1M<br />
noninteger splitting ratios by adjusting the effective splitting ratio<br />
Var mb roulette<br />
r ¼ Var½wðx0 Þr þ E½r2 : (44) upward to the integer value QMS and then applying rouletting to<br />
MN 0 MN0<br />
eliminate the excess particles produced during splitting.<br />
For comparison with the corresponding variance for importance Based on the form of Eq. (46), it follows that the corresponding<br />
sampling with [Eq. (38)] and without splitting [Eq. (38)], the pre- sample variance statistic<br />
vious result for Var[w(x0 )r] given in Eq. (21) can be applied to Eq. 0 12<br />
0<br />
(44), yielding an alternate form 2 1 X N<br />
@ 1 X<br />
QMS<br />
0 0<br />
~w x general A<br />
b<br />
s r;general ¼ 0 b j<br />
~ i ~r j;i m<br />
br (47)<br />
h i E ½wðxÞVar½rjx N 1 i¼1 M j¼1<br />
Var mb roulette ¼<br />
x<br />
h i<br />
r<br />
MN0 general<br />
(45) is an unbiased estimator for Var zM , where<br />
0 0<br />
Var½wðx ÞEr ½rjx 1 M<br />
þ þ E½r2 : h i h h ii h h ii<br />
MN 0 MN 0 general general 0 general 0<br />
Var zM ¼ Ex0 Var zM x þ Var Er zM x :<br />
From Eq. (44), it is clear that applying Russian roulette will al-<br />
ways cause the response variance to increase relative to importance (48)<br />
sampling for a fixed number of source samples (N0 ¼ N). This in- Again, Eq. (48) can be simplified into a more intuitive form,<br />
crease is attributable both to a decrease in the denominator of each following the same general approach as applied in Section 2.4.<br />
term because M < 1, as well as the presence of an extra additive As usual, we begin by expanding the first term on the right-hand<br />
term proportional to the square of the expected response. side of Eq. (48) and moving constant factors out of the variance<br />
If N0 and M are constrained such that the total amount of operator, which gives<br />
transport work is held constant (M N0 ¼ N), Eq. (44) shows that<br />
2 2 33<br />
rouletting will always produce an increase in response variance h h ii <br />
general 0 w2 ðx0 Þ X<br />
QMS 0<br />
relative to a traditional importance sampling scheme [Eq. (22)]. Ex0 Var zM x ¼ Ex0 4 Var4 b0 5 5:<br />
2 j x<br />
rj (49)<br />
This variance penalty appears as an additive term, which is pro- M j¼1 <br />
portional to (1 e M)$E[r]2. Unlike the splitting scheme considered<br />
previously, which was dependent on the biased source distribution Recognizing that the realizations from b0 are independent and<br />
used for importance sampling, the variance penalty for rouletting the rj variables are conditionally independent allows application of<br />
only depends on the roulette survival probability and the expected the Bienayme formula to Eq. (49), resulting in<br />
response. Interestingly, Eq. (44) indicates that there is no increase 2 0 <br />
h h ii w ðx ÞQMS<br />
in variance when rouletting particles in a system where the ex- general 0 0 0<br />
Ex0 Var zM x ¼ Ex0 Var½b rjx : (50)<br />
pected response is zero. M2<br />
<br />
2.6. Generalized weight adjustment scheme By the definition of variance and the independence of