Representative sample size calculation. [closed]












3















I want to manually analyze bug reports of three large software projects. Total bug reports of the three projects are 10,000, 12,000, and 8000. I need to examine bug reports, comments, and bug fixing files. Manually analyze all bug reports are a time-consuming and difficult task. For these reasons, I would like to take a sample of bug reports from each project. Would you please suggest me how many bugs reports from each project should I analyze to make a representative sample size.










share|improve this question













closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54



  • This question does not appear to be about programming within the scope defined in the help center.

If this question can be reworded to fit the rules in the help center, please edit the question.









  • 2





    I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.

    – Pang
    Nov 21 '18 at 9:16
















3















I want to manually analyze bug reports of three large software projects. Total bug reports of the three projects are 10,000, 12,000, and 8000. I need to examine bug reports, comments, and bug fixing files. Manually analyze all bug reports are a time-consuming and difficult task. For these reasons, I would like to take a sample of bug reports from each project. Would you please suggest me how many bugs reports from each project should I analyze to make a representative sample size.










share|improve this question













closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54



  • This question does not appear to be about programming within the scope defined in the help center.

If this question can be reworded to fit the rules in the help center, please edit the question.









  • 2





    I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.

    – Pang
    Nov 21 '18 at 9:16














3












3








3


2






I want to manually analyze bug reports of three large software projects. Total bug reports of the three projects are 10,000, 12,000, and 8000. I need to examine bug reports, comments, and bug fixing files. Manually analyze all bug reports are a time-consuming and difficult task. For these reasons, I would like to take a sample of bug reports from each project. Would you please suggest me how many bugs reports from each project should I analyze to make a representative sample size.










share|improve this question














I want to manually analyze bug reports of three large software projects. Total bug reports of the three projects are 10,000, 12,000, and 8000. I need to examine bug reports, comments, and bug fixing files. Manually analyze all bug reports are a time-consuming and difficult task. For these reasons, I would like to take a sample of bug reports from each project. Would you please suggest me how many bugs reports from each project should I analyze to make a representative sample size.







random open-source






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 13 '18 at 0:32









Radia KarimRadia Karim

363




363




closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54



  • This question does not appear to be about programming within the scope defined in the help center.

If this question can be reworded to fit the rules in the help center, please edit the question.




closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54



  • This question does not appear to be about programming within the scope defined in the help center.

If this question can be reworded to fit the rules in the help center, please edit the question.








  • 2





    I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.

    – Pang
    Nov 21 '18 at 9:16














  • 2





    I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.

    – Pang
    Nov 21 '18 at 9:16








2




2





I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.

– Pang
Nov 21 '18 at 9:16





I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.

– Pang
Nov 21 '18 at 9:16












1 Answer
1






active

oldest

votes


















3














It depends on the following two things:



Confidence level: It tells you how sure you can be. The 95% confidence level means you can be 95% certain; the 99% confidence level means you can be 99% certain. Most researchers use the 95% confidence level.



Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.



Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.



For example,



The population size of project A=10,000
Confidence Level = 95%
Confidence Interval =5%
So, representative sample size=370 (That means you should analyze 370 bug reports for project A)


I usually use the sample size calculator to calculate sample size.
(https://www.surveysystem.com/sscalc.htm#one)






share|improve this answer






























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    3














    It depends on the following two things:



    Confidence level: It tells you how sure you can be. The 95% confidence level means you can be 95% certain; the 99% confidence level means you can be 99% certain. Most researchers use the 95% confidence level.



    Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.



    Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.



    For example,



    The population size of project A=10,000
    Confidence Level = 95%
    Confidence Interval =5%
    So, representative sample size=370 (That means you should analyze 370 bug reports for project A)


    I usually use the sample size calculator to calculate sample size.
    (https://www.surveysystem.com/sscalc.htm#one)






    share|improve this answer




























      3














      It depends on the following two things:



      Confidence level: It tells you how sure you can be. The 95% confidence level means you can be 95% certain; the 99% confidence level means you can be 99% certain. Most researchers use the 95% confidence level.



      Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.



      Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.



      For example,



      The population size of project A=10,000
      Confidence Level = 95%
      Confidence Interval =5%
      So, representative sample size=370 (That means you should analyze 370 bug reports for project A)


      I usually use the sample size calculator to calculate sample size.
      (https://www.surveysystem.com/sscalc.htm#one)






      share|improve this answer


























        3












        3








        3







        It depends on the following two things:



        Confidence level: It tells you how sure you can be. The 95% confidence level means you can be 95% certain; the 99% confidence level means you can be 99% certain. Most researchers use the 95% confidence level.



        Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.



        Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.



        For example,



        The population size of project A=10,000
        Confidence Level = 95%
        Confidence Interval =5%
        So, representative sample size=370 (That means you should analyze 370 bug reports for project A)


        I usually use the sample size calculator to calculate sample size.
        (https://www.surveysystem.com/sscalc.htm#one)






        share|improve this answer













        It depends on the following two things:



        Confidence level: It tells you how sure you can be. The 95% confidence level means you can be 95% certain; the 99% confidence level means you can be 99% certain. Most researchers use the 95% confidence level.



        Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.



        Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.



        For example,



        The population size of project A=10,000
        Confidence Level = 95%
        Confidence Interval =5%
        So, representative sample size=370 (That means you should analyze 370 bug reports for project A)


        I usually use the sample size calculator to calculate sample size.
        (https://www.surveysystem.com/sscalc.htm#one)







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Nov 13 '18 at 1:18









        Rejaul KarimRejaul Karim

        225211




        225211















            Popular posts from this blog

            Florida Star v. B. J. F.

            Danny Elfman

            Retrieve a Users Dashboard in Tumblr with R and TumblR. Oauth Issues