Representative sample size calculation. [closed]
I want to manually analyze bug reports of three large software projects. Total bug reports of the three projects are 10,000, 12,000, and 8000. I need to examine bug reports, comments, and bug fixing files. Manually analyze all bug reports are a time-consuming and difficult task. For these reasons, I would like to take a sample of bug reports from each project. Would you please suggest me how many bugs reports from each project should I analyze to make a representative sample size.
random open-source
closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54
- This question does not appear to be about programming within the scope defined in the help center.
If this question can be reworded to fit the rules in the help center, please edit the question.
add a comment |
I want to manually analyze bug reports of three large software projects. Total bug reports of the three projects are 10,000, 12,000, and 8000. I need to examine bug reports, comments, and bug fixing files. Manually analyze all bug reports are a time-consuming and difficult task. For these reasons, I would like to take a sample of bug reports from each project. Would you please suggest me how many bugs reports from each project should I analyze to make a representative sample size.
random open-source
closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54
- This question does not appear to be about programming within the scope defined in the help center.
If this question can be reworded to fit the rules in the help center, please edit the question.
2
I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.
– Pang
Nov 21 '18 at 9:16
add a comment |
I want to manually analyze bug reports of three large software projects. Total bug reports of the three projects are 10,000, 12,000, and 8000. I need to examine bug reports, comments, and bug fixing files. Manually analyze all bug reports are a time-consuming and difficult task. For these reasons, I would like to take a sample of bug reports from each project. Would you please suggest me how many bugs reports from each project should I analyze to make a representative sample size.
random open-source
I want to manually analyze bug reports of three large software projects. Total bug reports of the three projects are 10,000, 12,000, and 8000. I need to examine bug reports, comments, and bug fixing files. Manually analyze all bug reports are a time-consuming and difficult task. For these reasons, I would like to take a sample of bug reports from each project. Would you please suggest me how many bugs reports from each project should I analyze to make a representative sample size.
random open-source
random open-source
asked Nov 13 '18 at 0:32
Radia KarimRadia Karim
363
363
closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54
- This question does not appear to be about programming within the scope defined in the help center.
If this question can be reworded to fit the rules in the help center, please edit the question.
closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54
- This question does not appear to be about programming within the scope defined in the help center.
If this question can be reworded to fit the rules in the help center, please edit the question.
2
I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.
– Pang
Nov 21 '18 at 9:16
add a comment |
2
I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.
– Pang
Nov 21 '18 at 9:16
2
2
I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.
– Pang
Nov 21 '18 at 9:16
I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.
– Pang
Nov 21 '18 at 9:16
add a comment |
1 Answer
1
active
oldest
votes
It depends on the following two things:
Confidence level: It tells you how sure you can be. The 95% confidence level means you can be 95% certain; the 99% confidence level means you can be 99% certain. Most researchers use the 95% confidence level.
Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.
Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.
For example,
The population size of project A=10,000
Confidence Level = 95%
Confidence Interval =5%
So, representative sample size=370 (That means you should analyze 370 bug reports for project A)
I usually use the sample size calculator to calculate sample size.
(https://www.surveysystem.com/sscalc.htm#one)
add a comment |
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
It depends on the following two things:
Confidence level: It tells you how sure you can be. The 95% confidence level means you can be 95% certain; the 99% confidence level means you can be 99% certain. Most researchers use the 95% confidence level.
Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.
Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.
For example,
The population size of project A=10,000
Confidence Level = 95%
Confidence Interval =5%
So, representative sample size=370 (That means you should analyze 370 bug reports for project A)
I usually use the sample size calculator to calculate sample size.
(https://www.surveysystem.com/sscalc.htm#one)
add a comment |
It depends on the following two things:
Confidence level: It tells you how sure you can be. The 95% confidence level means you can be 95% certain; the 99% confidence level means you can be 99% certain. Most researchers use the 95% confidence level.
Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.
Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.
For example,
The population size of project A=10,000
Confidence Level = 95%
Confidence Interval =5%
So, representative sample size=370 (That means you should analyze 370 bug reports for project A)
I usually use the sample size calculator to calculate sample size.
(https://www.surveysystem.com/sscalc.htm#one)
add a comment |
It depends on the following two things:
Confidence level: It tells you how sure you can be. The 95% confidence level means you can be 95% certain; the 99% confidence level means you can be 99% certain. Most researchers use the 95% confidence level.
Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.
Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.
For example,
The population size of project A=10,000
Confidence Level = 95%
Confidence Interval =5%
So, representative sample size=370 (That means you should analyze 370 bug reports for project A)
I usually use the sample size calculator to calculate sample size.
(https://www.surveysystem.com/sscalc.htm#one)
It depends on the following two things:
Confidence level: It tells you how sure you can be. The 95% confidence level means you can be 95% certain; the 99% confidence level means you can be 99% certain. Most researchers use the 95% confidence level.
Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.
Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.
For example,
The population size of project A=10,000
Confidence Level = 95%
Confidence Interval =5%
So, representative sample size=370 (That means you should analyze 370 bug reports for project A)
I usually use the sample size calculator to calculate sample size.
(https://www.surveysystem.com/sscalc.htm#one)
answered Nov 13 '18 at 1:18
Rejaul KarimRejaul Karim
225211
225211
add a comment |
add a comment |
2
I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.
– Pang
Nov 21 '18 at 9:16