The research is carried out to support Goal 3 of Sustainable Development Goals (SDGs), which titled Good Health and Well-being, and it ultimately aims to ensure healthy lives and promote well-being for all at all ages.

MST 2034 Design of Experiments Assignment

The research is carried out to support Goal 3 of Sustainable Development Goals (SDGs), which titled Good Health and Well-being, and it ultimately aims to ensure healthy lives and promote well-being for all at all ages. The dataset is acquired through website kaggle.com which offers open datasets that are reliable. The dataset itself, titled ‘Do we still need gyms?’ has 12 different variables, and samples from 1000 people who was tested with three new medicines to improve physical features. For a simpler model, only medicine 1 and medicine 2 are taken into account for the purpose of this research, although it didn’t mention which kind of medicines were used.

This objective of this research topic is about investigating how would initial weights and medicines taken affect the weight difference of people whose ages range from 19 to 50.

The experimental units are the people who underwent the experiment given. There are no specific replications, as the number of each treatment combinations would not be the same throughout the whole experiment. The treatments are their initial weight (in kilograms), which takes the Greek signs (α, β, γ, and δ) and the second one being the medicines taken, which takes the Latin signs (A, B, C, and D). As stated, each of the treatment takes four levels in the design. The minimum and maximum of their initial weights are 51.6 and 109.2 kg respectively, which then being grouped into four groups range from 51.6 – 75 kg (α), 75.1 – 80 kg (β), 80.1 – 85 kg (γ), and 85.1 – 109.2 kg (δ). While some of the ranges may deviate unusually than other groups, as this is to help balance the frequencies of each category (the data may come close to a uniform distribution, or else it would be difficult to sample from balanced interval). Whereas the other treatment is distributed into only medicine 1 is taken (A), only medicine 2 is taken (B), both medicines are taken (C), and none of the medicines are taken (D) by the participants. Although medicine 3 is also considered in the dataset given, but it won’t be evaluated as it will complicate the current model. The response variable is the weight difference, which takes the variable ‘KgAfter’ deducted by another variable ‘KgBefore’. The response variable won’t take the absolute value as it is to allow more accuracies to be maintained within the model itself.

Randomization exists in the research as the participants are varied in gender, ages, initial weights, initial kilometres run before taking medicines, medicines taken as according to their current physical fitness, health, and diseases, etc. There is also no particular order on how the samples were taken. Replication is not possible as often times there is no more than one sample for each treatment combination. Although some of the treatment combination would have multiple samples in it, we would directly average out the value to make it a single value or replication. Whereas blocking is made to eliminate the variability transmitted from nuisance factors. Blocking is possible as there are multiple binary targets in the dataset given. The first blocking is variable ‘gender’ combined with ‘age’ which gives four levels of blocking factor, consists of ‘male ages from 19 – 34’, ‘female ages from 19 – 34’, ‘male ages from 35 – 50’, and ‘female ages from 35 – 50’. The second blocking is variable ‘side effects’ combined with ‘difference in kilometres run’ which takes the variable ‘KmAfter’ deducted by another variable ‘KmBefore’, which then gives four levels of blocking factor, consists of ‘negative difference in kilometres run without side effects’, ‘negative difference in kilometres run with side effects’, ‘positive difference in kilometres run without side effects’, and ‘positive difference in kilometres run with side effects’. As stated, each of the blocking also takes four levels in the design. To simplify, each level of blocking factor is transformed into group 1, 2, 3, and 4, as shown in the table given later.

Linear model for the experimental design:

y_ijkl= µ + α_i + β_j + γ_k + δ_l + ε_ijkl

where

i = 1, 2, 3, 4

j = 1, 2, 3, 4

k = 1, 2, 3, 4

l = 1, 2, 3, 4

µ = overall mean

α_i = effect of initial weights (treatment 1)

β_j= effect of medicines taken (treatment 2)

γ_k = effect of gender/ age (blocking 1)

δ_l = effect of side effect/ KmRunDifference (blocking 2)

ε_ijkl = random error ̴ NID (0, σ²)

Pairwise Treatment Effects

Greek Treatment (Initial Weights)

Latin Treatment (Medicines Taken)

q_0.05(4, 3) = 6.824526

T_0.05 = 6.824526 * (52.01)^1/2 = 49.2171

There is total of 68 samples taken across all treatment combinations, although in the final model only takes up one value for each treatment combination.

Any pairs of treatment averages that differ in absolute value by more than 24.6085 would imply that the corresponding pair of population means are significantly different. However, there is none of the means pairs exceeds the indicated value, meaning that there is no significant difference in initial weights and the medicines taken at any setting.

Standard error = 49.2171, hence the 95% confidence interval for all treatment means:

Greek Treatment (Initial Weights)

Latin Treatment (Medicines Taken)

Contrast

Orthogonal contrast is suggested between categories 1 and 2 as a group and categories 3 and 4 as a group for both treatments, assuming that the mean difference of weight differences of the first group is the same as the second group for both treatments.




	Latin Treatment			Greek Treatment
	A	38.575		α	92.0867
	B	45.7283		β	47.2167
	C	40.5667		γ	24.8625
	D	58.9625		δ	19.6667

	Source of variation	df	Sum of Squares	Mean Square	F₀
	Latin	3	63.19	21.06	1.7452
	Greek	3	816.42	272.14	22.5491
	Rows	3	75.74	25.25	2.0918
	Columns	3	42.15	14.05	1.1643
	Orthogonal Contrasts
	C₁	1	140.35	140.35	11.6288
	C₂	1	3.62	3.62	0.3001
	Error	1	12.07	12.07
	Total	15	1153.54
	C₁: ȳ_1… + ȳ_2… – ȳ_3… – ȳ_4…
	C₂: ȳ_.1.. + ȳ_.2.. – ȳ_.3.. – ȳ_.4..

	F_{0.05, 3, 1}	215.707
	F_{0.05, 1, 1}	161.448

We conclude from F statistic (F₀< F_0.05,1,1) that there are no significant differences in mean weight difference between levels 1 and 2 and levels 3 and 4 of the initial weights, and also between levels 1 and 2 and levels 3 and 4 of the medicines taken at 5% significance level (by default).





or by Scheffe’s method
C₁: ȳ_1… + ȳ_2… – ȳ_3… – ȳ_4…			=	23.6935
C₂: ȳ_.1.. + ȳ_.2.. – ȳ_.3.. – ȳ_.4..			=	-3.8065
s_Cu	14.4239
s _0.05,u	76.092
where u = 1,2

Because C₁ < s _0.05,1, we conclude that the contrast equals to zero, that is, we conclude that the mean weight differences of initial weights 1 and 2 as a group does not differ from the means of initial weights 3 and 4 as a group. Furthermore, because C₂ < s _0.05,2, we conclude that the contrast equals to zero, that is, we conclude that the mean weight differences of medicines taken 1 and 2 as a group does not differ from the means of medicines taken 3 and 4 as a group, at 5% significance level (by default).

F₀ of treatment factor initial weights equals to 5.2322 is the closest to the critical value of F_0.05,3,3 = 9.27663, hence its H₀ has the highest possibility to be rejected than any other treatment or blocking factors. Hence, it is recommended to eliminate the initial weights treatment factor, reduced into a Latin square design with the remaining one treatment and two blocking factors. Hence, we would like to resample from the 1000 samples given as the eliminated factor would result in more samples for each treatment combination.

Linear model for the experimental design:

y_ijk= µ + α_i + β_j + γ_k + ε_ijk

where i = 1, 2, 3, 4

j = 1, 2, 3, 4

k = 1, 2, 3, 4

µ = overall mean

α_i = effect of medicines taken (treatment)

β_j= effect of gender/ age (blocking 1)

γ_k = effect of side effect/ KmRunDifference (blocking 2)

ε_ijk = random error ̴ NID (0, σ²)

Compared to Graeco-Latin square design, there is a much difference in the sum of squares of the elements perhaps due to the resampling, results in the change in the value of each treatment combination. However, F₀ does not differ as much due to the increasing degrees of freedom for error term. Though the critical value F_0.05,3,6 = 4.75706 as the increase in degrees of freedom in error term, but all of the H₀ are not to be rejected, which still remains the same as the previous design.

Reduction from Graeco-Latin to Latin square design in this scenario does not affect the result significantly, though sometimes it allows more error when such reduction occurs in any other research. The increase in degrees of freedom in error term results in a lower criterion for the rejection of H₀, which generally is not the proper way of conducting research. But in this case, Latin square design might be more preferable than Graeco-Latin square design as the latter does not improve the design in any way.

To further improve the design of the experiment for this research topic, it is suggested that more variable should be considered such as their Body Mass Index (BMI) to replace the current treatment factor ‘initial weights’ as the BMI is a more preferable and accurate representation and classification of the human health.

Reference

Sara, L. (2021). Do We Still Need Gyms? kaggle. https://www.kaggle.com/datasets/saralattarulo/do-we-still-need-gyms

Continue to order Get a quote

Calculate the price of your order

Type of paper needed:

Pages:

550 words

Academic level:

We'll send you the first draft for approval by September 11, 2018 at 10:52 AM

Total price:

$26

The price is based on these factors:

Academic level

Number of pages

Urgency

Basic features

Free title page and bibliography
Unlimited revisions
Plagiarism-free guarantee
Money-back guarantee
24/7 support

On-demand options

Writer’s samples
Part-by-part delivery
Overnight delivery
Copies of used sources
Expert Proofreading

Paper format

275 words per page
12 pt Arial/Times New Roman
Double line spacing
Any citation style (APA, MLA, Chicago/Turabian, Harvard)

The research is carried out to support Goal 3 of Sustainable Development Goals (SDGs), which titled Good Health and Well-being, and it ultimately aims to ensure healthy lives and promote well-being for all at all ages.

Recent Posts

Recent Comments

Calculate the price of your order

Our guarantees

Money-back guarantee

Zero-plagiarism guarantee

Free-revision policy

Privacy policy

Fair-cooperation guarantee