Representative sample size calculation. [closed]

I want to manually analyze bug reports of three large software projects. Total bug reports of the three projects are 10,000, 12,000, and 8000. I need to examine bug reports, comments, and bug fixing files. Manually analyze all bug reports are a time-consuming and difficult task. For these reasons, I would like to take a sample of bug reports from each project. Would you please suggest me how many bugs reports from each project should I analyze to make a representative sample size.

asked Nov 13 '18 at 0:32

Radia Karim

363

closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54

This question does not appear to be about programming within the scope defined in the help center.

If this question can be reworded to fit the rules in the help center, please edit the question.

2

I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.

– Pang
Nov 21 '18 at 9:16

add a comment |

asked Nov 13 '18 at 0:32

Radia Karim

363

closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54

This question does not appear to be about programming within the scope defined in the help center.

If this question can be reworded to fit the rules in the help center, please edit the question.

2

I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.

– Pang
Nov 21 '18 at 9:16

add a comment |

asked Nov 13 '18 at 0:32

Radia Karim

363

random open-source

asked Nov 13 '18 at 0:32

Radia Karim

363

asked Nov 13 '18 at 0:32

Radia Karim

363

asked Nov 13 '18 at 0:32

Radia Karim

363

asked Nov 13 '18 at 0:32

Radia Karim

363

asked Nov 13 '18 at 0:32

Radia Karim

363

closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54

This question does not appear to be about programming within the scope defined in the help center.

If this question can be reworded to fit the rules in the help center, please edit the question.

closed as off-topic by paddy, Peter O., Pang, Werner Henze, EdChum Nov 21 '18 at 9:54

This question does not appear to be about programming within the scope defined in the help center.

If this question can be reworded to fit the rules in the help center, please edit the question.

2

I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.

– Pang
Nov 21 '18 at 9:16

add a comment |

2

I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.

– Pang
Nov 21 '18 at 9:16

I'm voting to close this question as off-topic because it is about statistics instead of programming / coding / programming tools / software algorithms.

– Pang
Nov 21 '18 at 9:16

add a comment |

1 Answer
1

active

oldest

votes

It depends on the following two things:

Confidence level: It tells you how sure you can be. The 95% confidence level means you can be 95% certain; the 99% confidence level means you can be 99% certain. Most researchers use the 95% confidence level.

Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.

Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.

For example,

The population size of project A=10,000

Confidence Level = 95%

Confidence Interval =5%

So, representative sample size=370 (That means you should analyze 370 bug reports for project A)

I usually use the sample size calculator to calculate sample size.
(https://www.surveysystem.com/sscalc.htm#one)

answered Nov 13 '18 at 1:18

Rejaul Karim

225211

add a comment |

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

It depends on the following two things:

Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.

Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.

For example,

The population size of project A=10,000

Confidence Level = 95%

Confidence Interval =5%

So, representative sample size=370 (That means you should analyze 370 bug reports for project A)

I usually use the sample size calculator to calculate sample size.
(https://www.surveysystem.com/sscalc.htm#one)

answered Nov 13 '18 at 1:18

Rejaul Karim

225211

add a comment |

It depends on the following two things:

Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.

Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.

For example,

The population size of project A=10,000

Confidence Level = 95%

Confidence Interval =5%

So, representative sample size=370 (That means you should analyze 370 bug reports for project A)

I usually use the sample size calculator to calculate sample size.
(https://www.surveysystem.com/sscalc.htm#one)

answered Nov 13 '18 at 1:18

Rejaul Karim

225211

add a comment |

It depends on the following two things:

Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.

Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.

For example,

The population size of project A=10,000

Confidence Level = 95%

Confidence Interval =5%

So, representative sample size=370 (That means you should analyze 370 bug reports for project A)

I usually use the sample size calculator to calculate sample size.
(https://www.surveysystem.com/sscalc.htm#one)

answered Nov 13 '18 at 1:18

Rejaul Karim

225211

It depends on the following two things:

Confidence interval (margin of error): It is the plus-or-minus figure that is an acceptable deviation from the actual result. Most researchers use the 5% confidence interval.

Therefore, you can use a 95% confidence level and 5% confidence interval to generate your sample size.

For example,

The population size of project A=10,000

Confidence Level = 95%

Confidence Interval =5%

So, representative sample size=370 (That means you should analyze 370 bug reports for project A)

I usually use the sample size calculator to calculate sample size.
(https://www.surveysystem.com/sscalc.htm#one)

answered Nov 13 '18 at 1:18

Rejaul Karim

225211

answered Nov 13 '18 at 1:18

Rejaul Karim

225211

answered Nov 13 '18 at 1:18

Rejaul Karim

225211

answered Nov 13 '18 at 1:18

Rejaul Karim

225211

add a comment |

This page is only for reference, If you need detailed information, please check here

VTur wobBwQ1ZLEqHQL CuaMgpP5PS7alFq6ldWhXO,8YijPZ

搜尋此網誌

Ndtyjky