stats assn word 2

Upload: arpit-mehta

Post on 10-Apr-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/8/2019 Stats Assn Word 2

    1/19

    TABLE OF CONTENT

    1.INTRODUCTION

    2.LIST OF VARIABLES USED

    3.DESCRIPTIVE TESTS

    a)Qualitative Variableb)Quantitative Variable

    4.HYPOTHESIS TESTINGS

    a)One Sample t-testb)Independent paired t-testc)One Way Anova

    5CONCLUSION

    6.APPENDIX

    1)QUESTIONNAIRE2)NOMENCLATURE3)LIST OF FIGURES AND TABLES4)RAW DATA

  • 8/8/2019 Stats Assn Word 2

    2/19

    INFORMATION MANAGEMENT (STATISTICS)

    ATM Usage by HSBC Customers

    INTRODUCTION

    An online study was conducted between 25th

    March to 30th

    March 2010, for Information Management Statisticsto understand and find out how HSBC clients understand and use Automated Teller Machines in their everydaylives for banking transactions.

    The survey seeks to investigate and derive comprehensive statistical information on the satisfaction levelsof HSBC ATM cardholders across various age groups, different genders and categories of accounts and theiropinions with respect to various aspects like cash withdrawal facility, no of atms present and other positive andinconvenient features of the atm.It further suggests steps to be initiated to improve the service quality of HSBC ATM and enhance the securitylevels to make this mode of virtual banking more reliable and user friendly.

    This study is one type of exploratory research using convenience sampling technique where samples have beenselected on the basis of convenience. Thus, projecting data beyond the sample is statistically inappropriate.

    The population considered here is HSBC customers based out of Juhu,Mumbai,India.In keeping with the standards of professional research a random sample size of 279 was selected for thepopulation size of 1005 .

    The list of variables used in the spss,discussions and tables are as below :

    VARIABLES SPSS VARIABLE

    NAMES

    CODINGS MEASURES

    age Age 1=less than 202=20-303=30-404=40-505=more than 50

    Ordinal

    gender Gender 1=Male2=Female

    Nominal

    Type of account Type_ac 1=Mass market2=power vantage3=premier account

    Nominal

  • 8/8/2019 Stats Assn Word 2

    3/19

    4=private bankingaccount

    Occupation occupation 1=professional2=Self Employed3=Student4=Unemployed

    Nominal

    Common problem Common_prob 1=Not enough cash inATMs

    2=Not enough ATMsaround3=Currency quality of notes not good4=Any other

    Nominal

    No of times cash withdrawnfrom the ATM

    no_cashwith Scale

    No of times cash deposited inATMs

    no_cashdep Scale

    No of times cheques depositedin ATMs

    no_chequed Scale

    No of times balance enquirydone no_balenenqu Scale

    Security enhancement ways security 1=Installing CCTVs2=Employing SecurityPersonnel3=Reducing cashwithdrwal limit4=Using biometricsystem5=any other

    Nominal

    Preferred banking channel channel 1=branch banking2=internet banking3=atm banking4=phone banking

    Nominal

    No of times HSBC ATMs used no_hsbcatm Scale

    No of times other banks ATMsused

    no_diffbank Scale

    Service Ratings rateservice 1=excellent2=very good3=good4=poor

    Ordinal

    Descriptive Tests

    1.Qualitative Variables

  • 8/8/2019 Stats Assn Word 2

    4/19

    Following descriptive measures are employed to analyze the qualitative variable rateservice

    a) Bar Chart

    b)Frequency Table

    c)Mode

    The frequency of answers to the service rating questions from the ATM usage data is presented in Table 1 andFigure 1.

    FIGURE 1. Frequency distribution of ratings on service quality of HSBC ATMs.

    Its apparent from the distribution that service quality is perceived to be very good by most of the respondents

    followed closely for the excellent rating .

  • 8/8/2019 Stats Assn Word 2

    5/19

    Frequency Percent

    ValidPercent

    CumulativePercent

    Valid 1 123 42.9 44.1 44.1

    2 124 43.2 44.4 88.5

    3 31 10.8 11.1 99.6

    4 1 .3 .4 100.0

    Total 279 97.2 100.0

    Missing System 8 2.8Total 287 100.0

    TABLE 1. Frequency distribution of ratings on service quality of HSBC ATMs.

    MODE

    Mode represents the most frequently occurring value.Thus looking at the table 2 we conclude that the most people perceived the service quality to be very good since mode value ie 2 represents very good .

    Statistics

    rateservice

    N Valid 279

    Missing 8

    Mode 2

    Table 2. Table representing the mode value.

    Quantitative Variable

    HISTOGRAM

    Following descriptive measures are employed to analyze the quantitative variables.

  • 8/8/2019 Stats Assn Word 2

    6/19

    CONTINOUS DATA VARIABLE

    HISTOGRAM Since the underlying data is continuous the Histogram is used for the variable age

    The following tables and figure represent the continuous variable age.

    Statistics

    age

    N Valid 275

    Missing 12

    Mode 3

    Skewness .213

    Std. Error of Skewness .147

    Kurtosis -.285

    Std. Error of Kurtosis .293

    Table3. Table representing Skewness and Kurtosis

    age

    Frequency Percent Valid Percent

    Cumulative

    Percent

    Valid 1 9 3.1 3.3 3.3

    2 79 27.5 28.7 32.0

    3 116 40.4 42.2 74.2

    4 58 20.2 21.1 95.3

    5 13 4.5 4.7 100.0

    Total 275 95.8 100.0

    Missing System 12 4.2

    Total 287 100.0

    Table 4. TABULAR REPRESENTAION OF VARIABLE AGE

  • 8/8/2019 Stats Assn Word 2

    7/19

    FIGURE 2. HISTOGRAM REPRESENTING THE VARIABLEage

    The following discusses different measures of Central Tendency used to represent the variable no_balenenqu

    1).MeanAt the confidence level of 95% the mean for the variable no_balenenqu is 3.04.

    2)Median

    The Median for the underlying variable is 3.

    Below are measures of dispersion used for the variable no_balenenqu

    1)Variance

    The variance for the vaiable is 2.837

    2)Standard Deviation

    The standard deviation for the variable is 1.684

  • 8/8/2019 Stats Assn Word 2

    8/19

    3)Range

    As can be seen from the table range for the variable is 10

    4)SkewnessSkewness for the variable is 1.471

    5)Similarly Kutosis for the variable is 3.012

    Case Pro cessing Summa ry

    Cases

    Valid Missing Total

    N Percent N Percent N Percent

    no_balenenqu 279 97.2% 8 2.8% 287 100.0%

    Table 5. Table representing the valid percent , missing values of the variable no_balenenq

  • 8/8/2019 Stats Assn Word 2

    9/19

    Desc r iptives

    Statistic Std. Error

    no_balenenqu Mean 3.04 .101

    95% Confidence Interval for

    Mean

    Lower Bound 2.84

    Upper Bound 3.24

    5% Trimmed Mean 2.89

    Median 3.00

    Variance 2.837

    Std. Deviation 1.684

    Minimum 0

    Maximum 10

    Range 10

    Interquartile Range 2

    Skewness 1.471 .146

    Kurtosis 3.012 .291

  • 8/8/2019 Stats Assn Word 2

    10/19

    Table 6.Table representing Mean,Median ,Variance,Standard Deviation,Skewness and Kurtosis

    Statistics

    no_balenenqu

    N Valid 279

    Missing 8

    Mode 2

    Skewness 1.471

    Std. Error of Skewness .146

    Kurtosis 3.012

    Std. Error of Kurtosis .291

    The following section discusses the hypothesis testings in comparisons involving a single mean, two independent

    means ie One Sample T-test and Independent- Sample T-tests and the procedure for carrying out multiple

    comparisons of means ie ANOVA.

    ONE SAMPLE T-TEST

    The following statistical procedure is used to perform the One sample T-test for the variable no_cashwith

    Stage 1) Setting up the null Hypothesis:

    The mean for the no of times cash is withdrawn from HSBC ATMs is 5

    Thus,

    H0 : = 5

    Stage 2)Setting up the alternative hypothesis:

  • 8/8/2019 Stats Assn Word 2

    11/19

    The mean for the no of times cash is withdrawn from HSBC ATMs is not 5.Thus for

    Two tailed test

    HA : 5

    Stage 3)Specifying the desired level of significance: =0.05

    Stage 4)Determining the appropriate statistical test:

    The t test will be employed

    One-Sample Statistics

    N Mean Std. Deviation Std. Error Mean

    no_cashwith 279 3.39 1.173 .070

    One-Sample Test

    Test Value = 5

    t df Sig. (2-tailed) Mean Difference

    95% Confidence Interval of the

    Difference

    Lower Upper

    no_cashwith -22.977 278 .000 -1.613 -1.75 -1.47

    Table7 : Testing of a single mean :output of a t-test

    5)Reaching a conclusion

    The two tailed p-value is 0.00 which is smaller than =0.05 and therefore we reject H 0.

    Hence we conclude that the mean for the no of times cash is withdrawn from HSBC ATMs is not 5.

  • 8/8/2019 Stats Assn Word 2

    12/19

    INDEPENDANT SAMPLE T-TEST

    Following is procedure to test means of two independant populations

    1) Setting up the null Hypothesis:The mean of no of times cash is deposited is the same for both males and females.Thus

    H0 : M = F

    2)Setting up the alternative hypothesis:

    HA : M F

    ie The mean of no of times cash is deposited is not the same for both males and females

    3)Specifying the desired level of significance:

    =0.05

    4)Determining the appropriate statistical test:The t test of independent population will be employed.However before that the test for equality of variances

    needs to be done.The Levenes test provides information about the variances of the two poputations and the

    related hypotheses,using an =0.05

    H0: The variances of the two genders are equal ie, 1 = 2.

    HA : The variances of the two genders are not equal ie, 1 2.

    So at this point the p-value is used to test the hypothesis

    The p-value ie 4.488 is larger than the chosen level of significance ie 0.05 .Therefore we retain the

    H0 ie the variances of the two genders are not significantly different.

  • 8/8/2019 Stats Assn Word 2

    13/19

    G ro up Statistics

    no_cashdep N Mean Std. Deviation Std. Error Mean

    gender 1 75 1.39 .490 .057

    2 123 1.46 .501 .045

    Independent Samples Test

    Levene's Test for Equality of

    Variances t-test for Equality

    F Sig. t df Sig. (2-tailed) Mean Difference

    gender Equal variances assumed 4.488 .035 -1.055 196 .293 -.077

    Equal variances not

    assumed

    -1.060 159.039 .291 -.077

    Table 8 :Tests of two independent populations : output of a t-test

    5)Reaching a conclusion

    The two tailed p-value is .035 which is greater than =0.05 and therefore we retain H 0.

    Thus we conclude that is there is no significant difference between the means of number of cash deposited bymales and females

    ANOVA

    Since we want to simultaneously test more than two means ,ANOVA is employed.

  • 8/8/2019 Stats Assn Word 2

    14/19

    Stage 1) Setting up the null Hypothesis:H0 : 1 = 2 = 3 = 4

    There is no significant difference in the mean usage of other banks atms between different occupations.

    Stage 2)Setting up the alternative hypothesis:

    HA : 1 2 3 4

    Not all means are equal or not atleast two means are equal.

    Stage 3)Specifying the desired level of significance:

    =0.05

    Stage 4)Determining the appropriate statistical test:

    Desc r iptives

    no_diffbank

    N Mean Std. Deviation Std. Error

    95% Confidence Interval for Mean

    Minimum MaximumLower Bound Upper Bound

    1 126 1.61 1.480 .132 1.35 1.87 0 6

    2 141 1.62 1.486 .125 1.37 1.86 0 6

    3 2 2.00 1.414 1.000 -10.71 14.71 1 3

    4 8 2.50 1.414 .500 1.32 3.68 0 4

    Total 277 1.64 1.481 .089 1.47 1.82 0 6

  • 8/8/2019 Stats Assn Word 2

    15/19

    Test of Ho m ogeneit y of V ar iances

    no_diffbank

    Levene Statistic df1 df2 Sig.

    .085 3 273 .968

    AN OVA

    no_diffbank

    Sum of Squares df Mean Square F Sig.

    Between Groups 6.354 3 2.118 .965 .410

    Within Groups 599.264 273 2.195

    Total 605.617 276

    Table 9.Test of four independent populations : output of Anova

    Since there are more than two means,Anova is employed.However before that the test for equality of variances

    needs to be done.The Levenes test provides information about the variances of the two poputations and the

    related hypotheses,using an =0.05

    H0: The variances of all the four occupations are equal ie, 1 = 2= 3 = 4

    HA : The variances of the two genders are not equal ie, 1 2 1 2.

    So at this point the p-value is used to test the hypothesis.

    The p-value ie .968 is larger than the chosen level of significance ie 0.05 .Therefore we conclude that the data

    do not violate the equality of variance assumption of ANOVA.

    5)Reaching a conclusion:

    We use the sig value in order to reach a conclusion.The sig value .410 is larger than =0.05,therefore we retain the

    null hypothesis that there are no significant differences between the mean usage of other banks of different

    occupations.

  • 8/8/2019 Stats Assn Word 2

    16/19

    CONCLUSIONS

    Based on the study above it can be concluded the perception of most of the customers towards the service quality

    of the bank machines is satisfactory.The attributes such as freqency of cash deposits and frequency of usage of

    other banks ATMs is independent of genders and occupations repectively. However the tests do show the mostrespondents have to be dependent on other banks atms for their routine banking transaction since there are not

    enough HSBC ATMs in Mumbai.Moreover most of them believe that by emplyoing security guards it would me

    more secure banking through the ATMs .As far as the questionnaire and survey is concerned we notice that

    standardized questions make measurement more precise by enforcing uniform definitions upon the

    participants.Moreover open ended questions help make subtle distinctions and analyze data better.

    AppendixThe following questionnaire was used for the study.

    1.What is your age?a) less than 20 b) 20-30 c) 30-40 d) 40-50 e)more than 50

    2.What is your gender?a)Male b)Female

    3.What type of account do you hold in HSBC ?a) Mass Market b) Power Vantage Account c) Premier Account d)Private Banking Account

    4.What is your occupation?a).Professional b) Self Employed c) Student d).Unemployed.

    5).What is the most common problem you face while using HSBC ATMs ?a).Not enough ATMs around.b.) Not enough cash in the machine c) Currency Notes' quality is not good d) Lesscash withdrawal facility e) Any other

    6)How many times do you withdraw cash every month?

    7)How many times do you deposit cash every month?

    8)How many times do you deposit cash every month?

    9)How many times do you do you do balance enquiry every month?

  • 8/8/2019 Stats Assn Word 2

    17/19

    10).What according to you will help the most in enhancing the security of ATM banking?1)Installing the CCTVs 2)Employing security personnals 3) Decreasing the cash withdrawal limit 4) UsingBiometric system 5)Any other

    11).Which channel do you prefer the most for banking ?a)Branch banking b) Internet Banking c) ATM banking d)Phone Banking e) Any other

    12).How many times a month do you use the HSBC Atms ?

    13).How many times a month do you use other banks Atms?

    14).How would you rate your HSBC ATMs usage of satisfaction as compared to other banks?a)Excellent b)Very good) Good c) Average d)Poor

    Nomenclature

    Mean

    Mode z

    Standard deviation

    Level of significance

    variance

    Null Hypothesis H0

    Alternative Hypothesis HA

    List of Figures and Tables used

    FIGURE 1. Frequency distribution of ratings on service quality of HSBC ATMs.

    TABLE 1. Frequency distribution of ratings on service quality of HSBC ATMs.

    Table 2. Table representing the mode value.

  • 8/8/2019 Stats Assn Word 2

    18/19

    Table3. Table representing Skewness and Kurtosis

    Table 4. Tabular representaion of variable age

    Figure 2. Histogram representing the variable age

    Table 5. Table representing the valid percent , missing values of the variable no_balenenq

    Table 6.Table representing Mean,Median ,Variance,Standard Deviation,Skewness and Kurtosis

    Table7 : Testing of a single mean :output of a t-test

    Table 8 :Tests of two independent populations : output of a t-test

    Table 9 Test of four independent populations : output of Anova

    RAW DATA SET USED FOR THE PURPOSE OF THE STUDY

    1.0 1.0 1.0 1.0 3.0 1.0 5.0 2.0 0.0 5.0 1.0 1.0 7.0 1.0 1.02.0 2.0 1.0 2.0 1.0 4.0 4.0 3.0 1.0 4.0 2.0 3.0 5.0 2.0 2.03.0 2.0 2.0 1.0 2.0 1.0 3.0 1.0 1.0 5.0 2.0 1.0 6.0 3.0 2.04.0 4.0 1.0 2.0 2.0 4.0 4.0 2.0 2.0 4.0 2.0 1.0 7.0 2.0 3.05.0 5.0 1.0 2.0 2.0 2.0 5.0 2.0 1.0 5.0 1.0 1.0 6.0 4.0 1.0

    6.0 2.0 1.0 2.0 4.0 4.0 7.0 1.0 0.0 4.0 3.0 4.0 8.0 3.0 1.07.0 3.0 1.0 2.0 2.0 1.0 2.0 1.0 0.0 5.0 1.0 1.0 4.0 5.0 1.08.0 1.0 1.0 2.0 4.0 4.0 1.0 1.0 4.0 1.0 1.0 5.0 6.0 2.09.0 4.0 2.0 2.0 2.0 3.0 5.0 2.0 2.0 5.0 2.0 1.0 5.0 4.0 2.010.0 2.0 2.0 3.0 2.0 1.0 2.0 3.0 1.0 4.0 2.0 3.0 3.0 3.0 3.011.0 2.0 2.0 3.0 2.0 4.0 4.0 2.0 4.0 10.0 4.0 3.0 5.0 4.0 1.012.0 3.0 2.0 3.0 1.0 1.0 3.0 2.0 5.0 1.0 2.0 3.0 4.0 3.0 3.013.0 3.0 2.0 2.0 1.0 2.0 7.0 3.0 9.0 6.0 2.0 3.0 7.0 6.0 2.014.0 5.0 2.0 2.0 4.0 3.0 5.0 2.0 15.0 8.0 2.0 3.0 15.0 4.0 2.015.0 3.0 2.0 2.0 1.0 4.0 6.0 2.0 1.0 4.0 1.0 3.0 6.0 5.0 2.016.0 3.0 2.0 2.0 2.0 4.0 4.0 2.0 1.0 3.0 3.0 3.0 5.0 3.0 1.017.0 3.0 2.0 1.0 2.0 2.0 3.0 2.0 0.0 8.0 2.0 2.0 8.0 5.0 1.018.0 1.0 1.0 2.0 1.0 6.0 2.0 0.0 9.0 1.0 3.0 10.0 2.0 1.019.0 4.0 1.0 2.0 4.0 1.0 3.0 1.0 0.0 3.0 3.0 3.0 5.0 2.0 1.020.0 3.0 2.0 2.0 4.0 1.0 2.0 1.0 0.0 5.0 2.0 3.0 5.0 1.0 3.021.0 4.0 1.0 4.0 2.0 4.0 4.0 1.0 1.0 6.0 1.0 1.0 6.0 2.0 2.022.0 1.0 1.0 1.0 1.0 4.0 6.0 3.0 1.0 7.0 4.0 3.0 8.0 2.0 1.023.0 3.0 1.0 2.0 2.0 4.0 1.0 1.0 1.0 8.0 1.0 3.0 8.0 2.0 1.024.0 3.0 1.0 2.0 4.0 4.0 4.0 1.0 1.0 3.0 2.0 4.0 4.0 3.0 1.025.0 3.0 1.0 3.0 1.0 4.0 4.0 4.0 1.0 5.0 1.0 2.0 5.0 2.0 1.026.0 4.0 1.0 2.0 2.0 1.0 5.0 1.0 1.0 6.0 1.0 2.0 7.0 3.0 2.0

  • 8/8/2019 Stats Assn Word 2

    19/19

    27.0 2.0 2.0 2.0 2.0 1.0 3.0 1.0 1.0 3.0 1.0 3.0 3.0 1.0 1.028.0 3.0 2.0 4.0 1.0 1.0 6.0 5.0 1.0 5.0 2.0 3.0 6.0 2.0 1.029.0 3.0 1.0 2.0 1.0 1.0 7.0 2.0 1.0 6.0 2.0 1.0 7.0 1.0 1.030.0 3.0 2.0 1.0 4.0 1.0 9.0 2.0 0.0 2.0 2.0 3.0 10.0 3.0 2.031.0 3.0 2.0 2.0 1.0 4.0 1.0 2.0 0.0 2.0 1.0 3.0 4.0 2.0 1.032.0 4.0 2.0 2.0 1.0 4.0 3.0 1.0 0.0 3.0 2.0 2.0 4.0 3.0 1.033.0 4.0 1.0 3.0 1.0 4.0 4.0 1.0 0.0 2.0 2.0 3.0 4.0 4.0 1.034.0 3.0 1.0 2.0 1.0 1.0 2.0 1.0 1.0 3.0 3.0 3.0 3.0 3.0 2.035.0 2.0 2.0 2.0 1.0 1.0 3.0 2.0 1.0 2.0 2.0 1.0 4.0 2.0 1.036.0 4.0 1.0 2.0 1.0 1.0 4.0 1.0 2.0 3.0 1.0 3.0 4.0 3.0 1.037.0 4.0 1.0 2.0 2.0 4.0 3.0 2.0 2.0 2.0 1.0 3.0 4.0 4.0 2.038.0 4.0 1.0 2.0 2.0 4.0 2.0 1.0 2.0 3.0 1.0 4.0 4.0 5.0 1.039.0 3.0 1.0 1.0 2.0 4.0 3.0 2.0 1.0 2.0 2.0 3.0 5.0 1.0 1.040.0 4.0 2.0 2.0 2.0 4.0 4.0 2.0 0.0 3.0 2.0 3.0 4.0 2.0 1.041.0 2.0 2.0 2.0 2.0 1.0 3.0 2.0 0.0 2.0 3.0 2.0 4.0 1.0 2.042.0 3.0 2.0 2.0 2.0 1.0 2.0 1.0 0.0 3.0 2.0 3.0 5.0 1.0 1.043.0 2.0 1.0 2.0 2.0 1.0 5.0 1.0 0.0 2.0 2.0 3.0 5.0 3.0 1.044.0 4.0 1.0 2.0 1.0 4.0 1.0 2.0 0.0 3.0 4.0 3.0 4.0 0.0 2.0

    45.0 2.0 1.0 2.0 1.0 4.0 2.0 2.0 1.0 2.0 4.0 1.0 5.0 0.0 2.046.0 4.0 1.0 2.0 1.0 4.0 3.0 3.0 1.0 3.0 5.0 3.0 5.0 0.0 1.047.0 4.0 1.0 2.0 1.0 4.0 4.0 3.0 1.0 2.0 2.0 3.0 4.0 3.0 1.048.0 4.0 2.0 2.0 2.0 1.0 5.0 2.0 0.0 3.0 2.0 3.0 5.0 4.0 2.049.0 3.0 2.0 2.0 1.0 1.0 4.0 2.0 0.0 2.0 1.0 1.0 5.0 5.0 2.050.0 4.0 1.0 2.0 1.0 1.0 3.0 1.0 0.0 3.0 1.0 1.0 4.0 4.0 2.051.0 4.0 1.0 2.0 1.0 1.0 2.0 1.0 2.0 2.0 1.0 1.0 5.0 3.0 3.052.0 3.0 1.0 2.0 1.0 4.0 3.0 2.0 2.0 3.0 2.0 1.0 5.0 4.0 2.053.0 4.0 1.0 1.0 1.0 4.0 4.0 2.0 1.0 2.0 2.0 1.0 4.0 2.0 2.054.0 4.0 1.0 1.0 1.0 4.0 3.0 3.0 0.0 3.0 2.0 3.0 4.0 1.0 1.0

    55.0 3.0 1.0 1.0 2.0 4.0 2.0 3.0 0.0 2.0 3.0 3.0 5.0 2.0 1.056.0 4.0 1.0 1.0 2.0 1.0 1.0 2.0 0.0 3.0 3.0 1.0 4.0 3.0 1.057.0 3.0 1.0 1.0 2.0 1.0 3.0 2.0 2.0 2.0 2.0 3.0 5.0 0.0 2.058.0 3.0 1.0 1.0 2.0 1.0 4.0 1.0 1.0 3.0 2.0 2.0 4.0 1.0 3.0