• No results found

On minimax allocation of stratified random sampling when only the order of stratum variances is known

N/A
N/A
Protected

Academic year: 2023

Share "On minimax allocation of stratified random sampling when only the order of stratum variances is known"

Copied!
7
0
0

Loading.... (view fulltext now)

Full text

(1)

Statistics & Decisions 12, 195-201 (1994)

ON M IN IM A X ALLO C A TIO N OF STRATIFIED R A N D O M SAM PLIN G W H E N ONLY TH E O R D E R OF S T R A T U M V A R IA N C E S IS K N O W N 1

M anoranjan Pal and Pulakesh Maiti

R eceived: R evised version: Decem ber 21, 1993 A b s tr a c t

This paper proposes the minimax criteria for obtaining the sample sizes to different strata when only the ranks o f the stratum variances, apart from the stratum sizes, are known, and obtains a very simple and elegant solution to this problem.

1 Introduction

In many p ractical situations in sam ple survey it may not be possible to know the exact values o f the stratum variances or it may even be very difficult to get good estimates o f the stratum variances, whereas the order o f the stratum variances may easily b e fou n d ou t from other sources. T o be specific, suppose we have strata with known sizes N i, N2, . . . , N s and unknown variances <Tj, a j , . . . , cr*. T he problem is to m inim ize V { y st) with respect to n i, n2, . . . , n s, the respective sam ple sizes, where

( i - i ) y r t ^ W h V h

1

with Wh = Nh/N, N — J2 Nh and y/, = for all h,yhi den otin g the value o f the i-th unit o f the sam ple from the /i-th stratum . y st is unbiased for the population mean under sim ple random sampling scheme. Expression for variance o f yst is (e.g., Cochran (1974))

vhu h nh

'T h e p ro b le m o f fin d in g an op tim al a llo c a tio n o f sam p le sizes to d ifferen t stra ta under a given o r d e r in g /s p a c in g o f stra tu m variances was in itia lly raised by P rofessor S. P. M u k h op a d h ya y o f University o f C a lc u t ta b efore the audience in the “ Sem inar on P ro b le m s o f L arg e Scale Sam ple Survey in India: 26 - 27 D ecem ber, 1990” - con d u c te d by C o m p u te r S c ie n c e U n it o f Indian Statistical In stitu te.

AM S (1 980) s u b je c t classification : 62 C 20

K eyw ords and phrases: Stratified ran d om sa m p lin g , m in im a x

(2)

Since the stratum variances are not known, m inim ization o f V is not possible. How­

ever, if it is possible to know the order o f the stratum variances, say,

( 1 .2 ) a 2 < o\ < . ■ . < cr2 ,

then one can h ope to minimize V with respect to n\, n 2, . . . , n s as well as a\, cr|,. . . , a]

su b ject t o th e conditions Y1 n h — n, > 0 for all h together with the condition (1-2). It is necessary to introduce further restriction such as — K to avoid trivial solutions like a\ — 0 for all h for the case where the condition (1 .2 ), say, is im posed. T h e value o f K , as we shall see later, does not affect the optim um values for the problem considered in this paper. Thus the value o f K need not be known apriori. In Pal and M aiti (1991), the solution for the sam e problem has been obtained.

A possible criticism o f the above approach stems from the fact that we have no con trol over the cr2 values. T he m inim ization problem described above will give som e op tim u m values of u^’ s. But, there is no guarantee that the optim u m values will b e equal to the actual values. In fact, it is perfectly possible that the optimum values b e c o m e far different from the actual values. A m ore reasonable approach to tackle this problem would be to get

(1.3) Min M ax V,

n a 2

where n and a 2 are the vectors o f and a? values respectively, su bject to the same conditions as described earlier. This procedure, thus, tackles the adverse situations so far as cr2 is concerned. One may also find

(1.4) M ax M in V

( j2 n

to see what w ill be the m axim um possible value over <J2 o f th e m inim um variance, since <r2 is not known.

In this paper we present the m inim ax solution for the p rob lem where the condi­

tion (1.2) is im posed. It so happens that the minimax and the m axim in solutions give the sam e optim um values for n (and cr2) and hence, also for the values o f the o b je ctiv e function.

T o sum m arize the above points, our ob je ct is to find s 1X/2 2

(1.5) Min M ax V — —

su b ject to

5

(1.6) 0 < cr,2 < a\ < . . . < a 2s and J 2 a l =

h = l S

(1.7) 7ih > 0 for = and n h =

h= 1

n.

(3)

Here Wh — N^/N, N = Yli=i Ni, N i , , N 3, n and K are given p ositiv e constants.

Even though the n /,’ s should be integers, the optim ization p roblem becom es to o dif­

ficult under this constraint. So we solve the problem allowing n^'s to b e nonnegative and approxim ate the optim al n^’s by integers hoping that it will give a near optim al solution.

2 Solution of the Optimization Problem

We start w ith a result which can be used to find the m axim um in (1 .5 ) for given n satisfying (1 .7 ).

L e m m a 2 .1 Let aj, a2, . . . , as be positive constants. Then, M ax„ 2 Ylh=i ah&\ subject to (1.6) is K m a x , a < s bh, where bh = — ^ T.)-h aj-

Proof: Define

Z h = (s - h + \){a2h - cr^_j) for h = 1 , 2 , . . . , s,

where we take a I = 0. Then it is easy to check that Ylh=i aha \ = £ iU i ^hZh and (1.6) is equivalent to

S

(2.1) Zh > 0 for h = l , 2 , . . . , s and Y ' Z h = I\.

h= 1

Let bj = m a x i < K , i(,. It is clear that m ax ^ fe/,/?/, subject to (2.1) is K b 3, which is

attained when Zj = K and Z^ = 0 for all h ^ j .

Theorem 2 .2 F or given n satisfying (1.7), M axCT2 Y.h=i W h a l / n h subject to (1-6) is

(2-2) ^ XS a<X>

where

1 s N 2

(2-3) f h = --- j— r E — •

s - h + 1 jr'h Uj

This T h eorem follows from Lem m a 2.1 on taking a/, = W^/nh = N%/(N 2n.h).

Thus, to find the minim ax solution o f (1.5) subject to (1.6) and (1 .7 ), we have to solve the follow in g problem :

(2.4) Min m ax f h ( n )

"■ 1 <h<s ~ subject to (1 .7 ), where f h ( n ) is given by (2.3).

Theorem 2 .3 The minimum in (2-4) is attained at some n satisfying (1.7).

(4)

P r o o f: If we take n^aNI (i.e., nh = n N l/ J 2j N j ) , then m a x f h ( n ) — Y lN f/ n = A (sa y ). H ence for the m inim ization in (2.4), all n ’s for which m a x //,(n ) is larger than A can b e ignored. Now for any fixed h, there exists 8^ > 0 such that if n/* < 5j, then m ax, f h { n ) > A whatever be the other n /s . Thus the p roblem (2.4) subject to (1 .7 ) is equivalent to (2.4) su bject to

S

(2 .5 ) rik > 8h for h = l , 2 , . . . , s and = /i=i

Since the set o f n's satisfying (2.5) is a com pact set in 1RS and m a x ,<h<s f h ( n ) is a continuous function on it, it follows that the m inim um is attained at some n satisfying (2 .5 ) and so satisfying (1.7).

C h oose and fix an optim al solution n° o f (2.4) su b ject to (1.7). W e introduce som e notations. Let

{ m u m 2, . . . , m k} = { i : f , ( n ° ) = m ax f h(n 0) } ,

~ h

where

1 < m i < m 2 < . . . < m k < s.

W e shall w rite rrik+i = s + 1 for convenience. Also let A r = { m r, m T + 1 , . . . , m T+i — 1}

and tr = \Ar \ = m T+\ — m r for r = 1, 2 , . . . , k. 0

L e m m a 2 .4 ni\ = 1.

P r o o f: Suppose not. Consider n* defined by ( n° — ke if i = 1

n i = S n° + £ if i € { m j , . . . , m k} .

[ n° otherwise

T hen for sufficiently small e > 0, it is easy to see that n* satisfies (1.7), fi(n * ) <

f i ( n° ) for i = 2 , 3 , . . . ,s and a contradiction to the optim ality of

L e m m a 2 .5 Let 1 < i < j < s. Then

N,

>

Nj_

Tl° ~ 11°

M o reo ver, i f i , j G A r f o r some r, then equality holds in (2.6).

( 2 ‘ 6) n° - n° '

(5)

Proof: S u ppose (2.6) does not hold. Let n* be defined as

Then for sufficiently small e > 0, n* satisfies (1.7) and

( 5 - h + 1 ) ( / a K ) - f h ( n ° ) ) = _____

W + i K

if /i < I

\i i < h < j ' if A > j 0

Thus //i(n * ) < f h ( n °) for h — 1,2, . . . , s , strict inequality h olding at least for h = 1,2, . .. , i. It follows that n* is also optim al for (2.4). So by Lem m a 2.4, maXftA(M*) = f i ( n * ) . Since fi (n * ) < = m a x (,/i(n 0), we have a contradic­

tion to the o p tim a lity o f n°. This proves the first statement. If i , j £ A r and strict inequality h olds in (2 .6 ), we arrive at a contradiction in a sim ilar way by taking e to be negative with sufficiently small absolute value. Here it should be noted that when i < h < j, // ,( « * ) < f h { n°) is not true but fh(n*) < f i { n ° ) holds since |e| is

sufficiently sm all. T his proves the lem m a. □

We now in trod u ce som e more notations. W e define

for any i , j w ith 1 < i < j < 5 and let

jV(r) = N ( m r, m T+i — 1)

for r = 1 , 2 ,. . . , k. T h e quantities X ( i , j ) and X ( r) are defined analogously.

Lemma 2.6

(2.7) N (1) < N ( i ) < . . . < N w and

Also let

(2.8) N ( m r, i) > 7V(rj if i £ A T.

Proof: Since fh (n ° ) = X ( h , s ) , w e have X ( m i , s ) = X ( m 2, s) = . . . = X ( m k, s ) . Hence

(2.9) X (1) = X {2) = . . . = X ( k) = X ( m T, m u - 1)

(6)

for any r and u such that 1 < r < u < k + l. Let the co m m on value of N{/n° for all i € A r b e den oted by cT. Then

^ ( r) = ~j~ ^ ~ T 1 2 = Cr N(r)-

t T i £ A r ' T t € A r

Since c t > c2 > . . . > ck, (2.7) follows from (2.9). T o prove (2 .8 ), let i € A r. Then X ( i + 1,5) < X ( m r, s ) gives X ( m r, i ) > X ( m r , s) = X ^ y Since XhCtNh within Ar,

(2.8) follow s. D

T h e o r e m 2 .7 (i) m\ = 1 and f o r any r with 1 < r < k, mr + 1 — 1 is the smallest i in the range m T < i < s at which

(2.10) min N ( m T, i )

mr<t<5 is attained.

(ii) I f i 6 A T, then

0 _ N { r ) N j

(2.11) w? = ( 1 x n.

'Pk t N 2

P r o o f: W e first show that the minimum in (2.10) is attained at i = mr + 1 — 1. By (2.7) and (2 .8 ), we have N ( m u, i ) > whenever i £ A u with u > r. So it easily follow s that N ( m r, i ) > = N ( m T, m r+] — 1) for i = m r, m T + 1 , . . . , s. We next show that mr + 1 — 1 is the smallest i at which the m inim u m in (2.10) is attained.

Suppose not. Let the minimum be attained also at j with m r < j < m T+\ — 1. Then N { m r, j ) - N(t ), so N ( j + l , m r+i - 1) = JV(r) and X ( j + l , m r+1 - 1) = X {ry Hence X ( j + 1,3) = X ( m r , s ) , a contradiction, since j + 1 does not belong to { m i , m 2, . . . , m k}. This proves (i).

To prove (ii), we first show that

t /V2

(2.12) $ > ? = ^ - 4 ^ - x

, t c ‘ E t i tuN2u) For this, we have

Ni trN [r) _ t rN 2T)

$ > ? = £ - = y ■

i € A r l € A r r T ( r )

Since X ( r) is independent o f r, (2.12) follows. Now since n°a7V,- within A r, we have

— > ri: = —t——— x n.

tr (r) ]£Ar £ u = l t ^ ( u )

From T h eorem 2.7 it follows that the optim al n° is unique and can be determined by the follow in g Procedure: First find m i = l , m2, . . . ,m ( using (i). T hen find n°, using (ii).

(7)

The op tim u m sam ple sizes have to be approxim ated roughly to the nearest in­

tegers (n l values, say), though the op tim u m sample sizes that w ould have been obtained if we restrict the values to natural numbers m a y n ot b e sam e as the n*h values. It should be pointed out that a m ore desirable criterion for optim um allocation w ou ld b e to im pose the set o f restrictions 2 < n /,(< N h) and n/, is an integer for all h. T his problem has not yet been solved.

A ck n ow led gem en ts. W e gratefully acknowledge Professors Bikas K . Sinha and A. R. R a o o f Indian Statistical Institute who went through an earlier draft o f this paper and gave many constructive suggestios for im provem ent and reduction o f this paper. B esides, Professor Sinha pointed out the superiority o f taking m inim ax criterion over the earlier (Pal and M aiti (1991)) criterion and P rofessor R ao pointed out the necessity o f proving the existence o f the solution. In fa ct, he supplied the proof of T h eo re m 2.3

References

[1] Cochran, G . W . (1974): Sampling Techniques. Second W iley Eastern Reprint.

John W ile y &; Sons, Inc.

[2] Pal, M . and P. Maiti (1991): O p tim u m Stratified R andom Sam pling Under a Given O rd erin g o f Stratum Varianced. Tech. Report No. E R U /6 /9 1

Manoranjan Pal and Pulakesh Maiti Economic Research Unit

Indian Statistical Institute 203 Barrackpore Trunk Road Calcutta 700 035, India

References

Related documents

For every data point, compute activations, pick (very few nodes) with high activations (using Bernoulli Sampling).. Need few nodes but Identifying which ones requires

Although the spread of the COVID-19 pandemic in the GCC region (United Arab Emirates, Bahrain, Saudi Arabia, Oman, Qatar and Kuwait) is still lower than most other parts of

Bank Statements of last 3 months 50% of Average of last 03 months bank balance would be considered as maximum premium paying capacity where Saving / Current (proprietorship)

Through the national level sample survey for estimation of marine fish landings based on the Stratified Multistage Random Sampling Design (SMRSD) information on individual

World liquids consumption for energy in the industrial sector, which was projected to increase by 1.1 percent per year from 2005 to 2030 in the IEO2008 reference case, increases by

Percentage of countries with DRR integrated in climate change adaptation frameworks, mechanisms and processes Disaster risk reduction is an integral objective of

This report provides some important advances in our understanding of how the concept of planetary boundaries can be operationalised in Europe by (1) demonstrating how European

The estimate provided here is the data product of CMFRI based stratified multistage random sampling design and may have to be reused with due citation credentials.. # The estimates