The Effects of Real Life Background of Online Product Photos on Consumer Behavior.

Research Questions, Hypotheses, and Effects

Present backgrounds of product images are basically two kinds, real-world backgrounds, and white backgrounds. The null hypothesis will be that products with real-world background images do not influence consumer behavior. The alternative hypothesis will be that products with real-world background images influence consumer behavior. The sample will randomly and evenly be selected from students in different states in the U.S. The experiment will cooperate with a furniture shopping website and there will be two kinds of images with contrasting backgrounds for the same products. The purchasing rate and the rate of adding to the shopping cart will all be analyzed to prove one of the hypotheses. The data on the size and price of the furniture will also be collected to test whether there are other relationships. This study aims to answer the following questions:
Research Question 1: Does a product with a real world background image have a higher purchase rate than a product with a white background?
Description: We compare the purchase rate of products with different backgrounds of product photos. R_T represents the purchasing rate of the product in the treatment group which uses product photos with a real world background, and R_C represents the purchasing rate of the product in the control group which uses product photos with a white background. The suggested effect size is 10%.

Null Hypothesis (H0_a): R_T <= R_C

Alternative Hypothesis (H1_a): R_T > R_C

Research Question 2: Does a product with a real world background image have a higher add-to-cart rate than a product with a white background?
Description: We compare the add-to-cart rate of products with different backgrounds of product photos. R_T represents the add-to-cart rate of the product in the treatment group which uses product photos with a real world background, and R_C represents the add-to-cart rate of the product in the control group which uses product photos with a white background. The suggested effect size is 10%.

Null Hypothesis (H0_b): R_T <= R_C

Alternative Hypothesis (H1_b): R_T > R_C

Importance of the Study and Social Impact

E-commerce and online shopping platforms have been emerging and developing rapidly in recent years. The COVID-19 outbreak that started in 2020 also boosted the evolution of online retail and doubled online orders during the first wave of the pandemic (Szász et al., 2022). As the size of the online shopping market expands, the competition between businesses is also getting fiercer over time. Online retail sellers have invested a great amount of time and money into increasing their market share and sales by optimizing online marketing strategies. Among various marketing strategies, visual marketing has been regarded as a critical part of online marketing as consumers’ decisions rely greatly on the visual perception of products and the photos of an item can deliver much more detailed information and experiences to consumers compared to non-visual descriptions (Pieters & Wedel, 2012). However, little existing literature focuses on the effect of product photos with different backgrounds on consumers’ buying attitudes. Present studies are also restricted to a small fraction of various product categories. Among the limited amount of studies about this topic, the majority were conducted using clothes or other fashion items as target products. Real-life backgrounds such as streetscapes and indoor settings were found to have positive effects on increasing sales volume as well as upscaling the quality of the clothes (Xia et al., 2020).

With the rapid development of online retail, the types of products people purchase online are getting increasingly varied. Our study focuses on the online furniture market in the U.S. which has a market size valued at $27.74 billion in 2021 and predicted to reach over 40 billion dollars by 2030 (ReportLinker, 2022). However, there is little existing literature and studies about the impact of product images with different backgrounds on home furniture, which leaves a great potential for increasing sales by understanding the methodology behind product photography and improving visual marketing in the online furniture retail industry. Therefore, our research will help fill the gap in the present literature. It will also provide valuable insights for online furniture retailers to improve their visual marketing strategies and boost product sales. Additionally, well-designed product images will also improve the effectiveness of visual marketing, increase customer retention and bring long-term profits to sellers. The findings of our research can also build a foundation for future research which further investigates how images with different backgrounds or components convey different information. Future studies concerning modifying product photography for different groups of customers such as people with dyslexia and color blind can be conducted.

Literature Review

E-commerce is now a common format of retail trade and a quickly expanding industry as well. The competition on B2C online shopping websites is becoming increasingly fierce. A simple search for one certain product on an online shopping platform can generate thousands of products supplied by different sellers. A major advantage of online shopping is that it is convenient. However, a significant disadvantage is that the product cannot be touched or felt. Therefore, images play an important role in online shopping. Lots of present literature indicated that consumers’ buying decisions can be affected both directly or indirectly by the attention that consumers put on the product.Other than the price, visual aspects of the product can also affect a consumer’s purchase decision to a large extent (Gonchigjav, 2020).It was found that as many as 87.6% of respondents considered the product image to be central to their shopping experience (Ergonode, 2022). Researchers studying online consumer behavior believe that most digital content is consumed by scrolling - until an interesting graphic catches the eyes of the user, it is difficult to keep them interested for even a short period of time(Goswami, 2011).
Photos that perfectly illustrate functional characteristics of a product and are presented in an acceptable arrangement that suits a certain category or industry have a greater impact. Good product images boost sales and help to develop the overall brand image as professional, innovative, and detail-oriented, which is particularly essential in the case of furniture. Among different visual marketing strategies, designing product photos is regarded as a crucial strategy to improve the effectiveness of communicating product information, building the positive first image of the product, as well as shaping consumers’ buying decisions (Jeon & Yoh, 2014). Colors could be a variable that affects consumers precision. When product color and background color are presented together in the consumer’s vision, these colors will constitute the visual effect of the product–background color combination. (Huang ,2004) .If the product is shot in a real life background, there are studies that show relevance between the real world scenario and consumer attractiveness. In the experiment in the research (Kim et al., 2014) The results showed that a product presentation was significantly different in attractiveness, informativeness, satisfaction, and repurchase intention after controlling apparel items and model. This product presentation in everyday life had greater mean values than product presentation with the posing model. In other words, background could be an important variable to make the purchase.

Research Plan

Population of Interest

The authors of this research plan are mainly interested in how the backgrounds of selling products will influence the purchasing pattern. This study would focus on everyone in the United States who are willing to buy furniture online as the population of interest.
Sample Selection

The authors plan to randomly and evenly select college students from different states in the US using interval sampling method. The company automatically gives every other customer coming to buy products the experimental websites. The study would use IP tracking to help make sure that each user is consistently in either the control group or treatment group. The sample size is designed to have 3000+ participants. The operation procedure will be conducted through cooperating with an online furniture-selling website, ideally a start-up company. The reason for choosing start-up companies is because large companies might already have their own technical departments doing similar things to this study, and also, start-up companies would be willing to cooperate because this study may potentially boost the sales. The online furniture-selling company would help to make two sets of websites that contain the same products, one with real world background pictures of the product (treatment group), while the other has pure white backgrounds of the product (control group). The two websites are both real ones which means the customers can actually purchase the order through it. The only difference between the two websites are the pictures of products with different backgrounds, other than that, everything is controlled to be the same. The participants will be randomly selected for viewing one of the two websites. This study aims to collect the following data to analyze: the percentage that participants would add the products to their shopping cart and the purchase rate. The outcome will be based on the purchasing of the products within the first 48 hours from the first view. The control group in this study is the product pictures with white backgrounds, and the experimental group is the product pictures with real-world setting backgrounds.
Operational Procedures

The participants in the study will be the study group members as well as the authors of this research. Since this is an experimental research, the participants are all the consumers that enter the website and the system will randomly distribute them to link to one of the two website pages. Each participant will be randomly selected to one of the two website pages and continue their shopping just like a normal shopping experience. One of the websites is designed to have product pictures with white backgrounds. Another website is designed to have product pictures with real life backgrounds. Comparing the behavior of the two groups of the customers, we expect to find the insights whether background pictures affect customers behavior. The two websites are constructed by the startup company and we cooperate with them on a reciprocity contract. They provide the construction of the website and the data collection and we conduct the analysis of the data. The experiment will be conducted directly at the startup’s furniture online store website. On top of that, the experiment is 24 hours non stop during the experiment duration.
Brief Schedule

The whole research will be conducted in three phases. The first phase is an experiment cooperating with a startup company. The second phase is the data collection. The last phase is the analysis and the draw of the conclusion. The whole study will take five months to complete. The experiment will be started on July 15th, 2022, and end on October 15th 2022. The whole experiment will take three months to complete. This duration is discussed with the startup company and is chosen because history sales data reveals that there is a selling peak during the summer season. The whole study is estimated in three months however it depends on whether we have collected enough samples. The experiment duration would be extended until we collect the sample amount we expected. The data collection is estimated to be finished in one month, which is in the middle of November. Lastly, the final phase will be a conclusion in which we will analyze the result using relevant statistical methods. We will also address limitations and problems that occurred and directions for future research improvements.
Data Collection

When our experiment period is over, we will collect the shopping cart rate and the purchase rate of each customer in the two groups that had participated in the experiment. We use interval sampling to drive customers into one of the two websites. The customer will be equally distributed in two groups. We will then use both Excel and R to record and analyze the data.
Data Security

As we are collaborating with a startup company, we will be accessing the data stored in the company’s database which will be using asymmetric encryption to protect the privacy of the customer data.
Variables
- Outcomes(Dependent Variables)
  1. Purchase rate (Conversion rate): It is the percentage purchases made on the website. To calculate it, simply divide the number of purchases made divided by the total number of visitors of the website and then multiply by 100.
  2. Add-To-Cart Rate: It is the percentage of visitors who add at least one item to their cart in a given session. To calculate it, simply divide the total number of sessions where someone adds an item to the cart by the total number of sessions and then multiply by 100.
- Treatments (Independent Variables)
  1. Background: The background image of the product can be a real life background or a white background.

Statistical Analysis Plan

We have two research questions and will stimulate two scenarios of each. In the beginning, we will stimulate the statistics power first and then start the stimulations of the sample groups. For the two research questions, we randomly assign 3000 samples into two groups, each will have 1500 samples. The sample will be in binomial normal distribution. For research question 1, we set a proportion of 0.15 in the control groups of both the scenarios and treatment group as 0.15 and 0.25 in scenario 1 and 2 respectively. For research question 2, we set a proportion of 0.25 in the control groups of both the scenarios, as the proportion of adding to cart is higher than purchasing, and treatment group as 0.25 and 0.35 in scenario 1 and 2 respectively. We will use a proportion T test to process the stimulation. We will then conduct a repetition of the stimulation. After the simulations have been done, we will analyze the mean effects, confidence level, and the percentage of type 1 and type 2 errors. We will compare the indicators by chart to compare the difference.

Sample Size and Statistical Power

The sample size we use is 3000 and 1500 in each group. We decided this sample size to get higher statistical power. We use power test to get our statistical power. The statistical power is 0.86 with significance level 0.05.

Possible Recommendations

There are some possible recommendations related to the study. First of all, if we fail to reject the null hypothesis, which means that the pictures with real-world background don’t work better than those with white backgrounds, we would suggest to keep using the white background photo for online shopping. However, if the statistical test shows that the null hypothesis should be rejected, then we may conclude that the pictures with real-world background may generate more profits for retail companies, then we would suggest adding more real-life pictures, and possibly hire some photographers to achieve the goal.

Limitations and Uncertainties

Our experiment was designed to randomly select participants and redirect them to either the website for the control group which shows them product photos with blank backgrounds or the website for the treatment group which shows them product photos with real-life backgrounds. Although such a setting can help us know the real reactions of the company’s target customers to different backgrounds of product photos, we are not able to collect and control some characteristics of the sample such as customers’ demographic information including age, income, and marital status which can affect consumers’ purchasing behavior. Also, as each participant will be linked to only one type of website, we cannot measure the effect of different backgrounds of product photos on the click rate. Additionally, since furniture is more expensive and depreciates at a slower rate than other products, the average purchase rate of furniture would also be lower, which leads to a small effect size and therefore requires a larger sample size to have enough statistical power. As a result, collecting enough sample data would be a costly part of the research considering the budget limit of a startup company.

Part 2: Simulation Effects

library(pwr)
library(data.table)
library(DT)
library(dplyr)

## 
## Attaching package: 'dplyr'

## The following objects are masked from 'package:data.table':
## 
##     between, first, last

## The following objects are masked from 'package:stats':
## 
##     filter, lag

## The following objects are masked from 'package:base':
## 
##     intersect, setdiff, setequal, union

library(kableExtra)

## 
## Attaching package: 'kableExtra'

## The following object is masked from 'package:dplyr':
## 
##     group_rows

Simulating the statistical power

pwr.test = pwr.t2n.test(n1 = 1500, n2 = 1500, d = 0.1, sig.level = 0.05, 
    alternative = "greater")
pwr.test

## 
##      t test power calculation 
## 
##              n1 = 1500
##              n2 = 1500
##               d = 0.1
##       sig.level = 0.05
##           power = 0.8628341
##     alternative = greater

Research Question 1: Does a real world background image have a greater purchasing rate than white backgrounds?

Scenario 1: No Effect on Purchase Rate

n = 3000
set.seed(1031)

#By randomly assign 3000 into two groups, each would have 1500.
purchase_rate_S1.dat = data.table(Group = c(rep.int(x = "Treatment", times = n/2), rep.int(x = "Control", times = n/2)))

purchase_rate_S1.dat[Group == "Control", PR := round(x = rbinom(n = 1500, size = 1, prob= 0.15))]
purchase_rate_S1.dat[Group == "Treatment", PR := round(x = rbinom(n = 1500, size = 1, prob = 0.15))]
datatable(data = purchase_rate_S1.dat)

table(purchase_rate_S1.dat)

##            PR
## Group          0    1
##   Control   1282  218
##   Treatment 1286  214

#Number of people in Treatment group with Purchase Rate as 1
purchase_rate_treatment_S1 = purchase_rate_S1.dat%>%
  filter(PR==1, Group == 'Treatment')%>%
  nrow()

 
#Number of people in Control group with Purchase Rate as 1
purchase_rate_control_S1 = purchase_rate_S1.dat%>%
  filter(PR==1, Group == 'Control')%>%
  nrow()

Analysis

Applying the two sample proportion test

purchase_rate_S1 = prop.test(x = c(purchase_rate_treatment_S1,purchase_rate_control_S1), n = c(n/2, n/2),alternative = 'greater'); purchase_rate_S1

## 
##  2-sample test for equality of proportions with continuity correction
## 
## data:  c(purchase_rate_treatment_S1, purchase_rate_control_S1) out of c(n/2, n/2)
## X-squared = 0.024338, df = 1, p-value = 0.562
## alternative hypothesis: greater
## 95 percent confidence interval:
##  -0.02442018  1.00000000
## sample estimates:
##    prop 1    prop 2 
## 0.1426667 0.1453333

Function

analyze.experiment <- function(the.dat) {
    require(data.table)
    setDT(the.dat)
    
    the.test <- t.test(x = the.dat[Group == "Treatment", 
        PR], y = the.dat[Group == "Control", PR], alternative = "greater")
    
    the.effect <- the.test$estimate[1] - the.test$estimate[2]
    upper.bound <- the.test$conf.int[2]
    p <- the.test$p.value
    
    result <- data.table(effect = the.effect, upper_ci = upper.bound, 
        p = p)
    
    return(result)
}
analyze.experiment(purchase_rate_S1.dat)

##          effect upper_ci         p
## 1: -0.002666667      Inf 0.5823553

Repeat the Experiment with Simulation

B <- 1000
n <- 3000
RNGversion(vstr = 3.6)
set.seed(1031)
Experiment <- 1:B
Group <- c(rep.int(x = "Treatment", times = n/2), rep.int(x = "Control", times = n/2))

sim.dat_r1s1 <- as.data.table(expand.grid(Experiment = Experiment, Group = Group))
setorderv(x = sim.dat_r1s1, cols = c("Experiment", "Group"), order = c(1,1))
sim.dat_r1s1[Group == "Control", PR := round(x =  rbinom(n = .N, size = 1, prob= 0.15), digits = 1)]
sim.dat_r1s1[Group == "Treatment", PR := round(x = rbinom(n = .N, size = 1, prob = 0.15), digits = 1)]
dim(sim.dat_r1s1)

## [1] 3000000       3

Results

exp.results_r1s1 <- sim.dat_r1s1[, analyze.experiment(the.dat = .SD), 
    keyby = "Experiment"] 

DT::datatable(data = round(x = exp.results_r1s1[1:100, ], digits = 3), 
    rownames = F)

pvalue = mean(exp.results_r1s1$p)
table_r1s1 <- data.table(Research_Question = "Question 1",
                         Scenario = "No Effect",
                         Mean_Effect_in_Simulated_Data = mean(exp.results_r1s1$effect),
                         Ninety_Five_Percent_Confidence_Interval_of_Mean_Effect = mean(exp.results_r1s1$upper_ci),
                         Percentage_of_False_Positives = exp.results_r1s1[, mean(p < 0.05)],
                         Percentage_of_True_Negative = 1-exp.results_r1s1[, mean(p < 0.05)],
                         Percentage_of_False_Negative = "",
                         Percentage_of_True_Positives = ""
)
table_r1s1

##    Research_Question  Scenario Mean_Effect_in_Simulated_Data
## 1:        Question 1 No Effect                       0.00047
##    Ninety_Five_Percent_Confidence_Interval_of_Mean_Effect
## 1:                                                    Inf
##    Percentage_of_False_Positives Percentage_of_True_Negative
## 1:                         0.052                       0.948
##    Percentage_of_False_Negative Percentage_of_True_Positives
## 1:

Scenario 2: An Expected Effect on Purchase Rate

#Purchase Rate - > Scenario 2: An expected effect
n <- 3000
set.seed(1031)

purchase_rate_S2.dat <- data.table(Group = c(rep.int(x = "Treatment", times = n/2), rep.int(x = "Control", times = n/2)))

purchase_rate_S2.dat[Group == "Control", PR := round(x = rbinom(n = 1500, size = 1, prob= 0.15))]
purchase_rate_S2.dat[Group == "Treatment", PR := round(x = rbinom(n = 1500, size = 1, prob = 0.25))]
datatable(data = purchase_rate_S2.dat)

table(purchase_rate_S2.dat)

##            PR
## Group          0    1
##   Control   1282  218
##   Treatment 1120  380

# Number of people in Treatment group with Purchase Rate as 1
purchase_rate_treatment_S2 = purchase_rate_S2.dat%>%
  filter(PR==1, Group== 'Treatment')%>%
  nrow()

# Number of people in Control group with Purchase Rate as 1
purchase_rate_control_S2 = purchase_rate_S2.dat%>%
  filter(PR==1, Group == 'Control')%>%
  nrow()

Analysis

Applying the two sample proportion test

purchase_rate_S2 = prop.test(x = c(purchase_rate_treatment_S2 ,purchase_rate_control_S2), n = c(n/2, n/2),alternative = 'greater'); purchase_rate_S2

## 
##  2-sample test for equality of proportions with continuity correction
## 
## data:  c(purchase_rate_treatment_S2, purchase_rate_control_S2) out of c(n/2, n/2)
## X-squared = 54.138, df = 1, p-value = 9.347e-14
## alternative hypothesis: greater
## 95 percent confidence interval:
##  0.083559 1.000000
## sample estimates:
##    prop 1    prop 2 
## 0.2533333 0.1453333

Function

analyze.experiment <- function(the.dat) {
    require(data.table)
    setDT(the.dat)
    
    the.test <- t.test(x = the.dat[Group == "Treatment", 
        PR], y = the.dat[Group == "Control", PR], alternative = "greater")
    
    the.effect <- the.test$estimate[1] - the.test$estimate[2]
    upper.bound <- the.test$conf.int[2]
    p <- the.test$p.value
    
    result <- data.table(effect = the.effect, upper_ci = upper.bound, 
        p = p)
    
    return(result)
}
analyze.experiment(purchase_rate_S2.dat)

##    effect upper_ci            p
## 1:  0.108      Inf 5.304356e-14

Repeat the Experiment with Simulation

B <- 1000
n <- 3000
RNGversion(vstr = 3.6)
set.seed(1031)
Experiment <- 1:B
Group <- c(rep.int(x = "Treatment", times = n/2), rep.int(x = "Control", times = n/2))

sim.dat_r1s2 <- as.data.table(expand.grid(Experiment = Experiment, Group = Group))
setorderv(x = sim.dat_r1s2, cols = c("Experiment", "Group"), order = c(1,1))
sim.dat_r1s2[Group == "Control", PR := round(x =  rbinom(n = .N, size = 1, prob= 0.15), digits = 1)]
sim.dat_r1s2[Group == "Treatment", PR := round(x = rbinom(n = .N, size = 1, prob = 0.25), digits = 1)]
dim(sim.dat_r1s2)

## [1] 3000000       3

Results

exp.results_r1s2 <- sim.dat_r1s2[, analyze.experiment(the.dat = .SD), 
    keyby = "Experiment"] 

DT::datatable(data = round(x = exp.results_r1s2[1:100, ], digits = 3), 
    rownames = F)

pvalue = mean(exp.results_r1s1$p)
table_r1s2 <- data.table(Research_Question = "Question 1",
                         Scenario = "Expected Effect",
                         Mean_Effect_in_Simulated_Data = mean(exp.results_r1s2$effect),
                         Ninety_Five_Percent_Confidence_Interval_of_Mean_Effect = mean(exp.results_r1s2$upper_ci),
                         Percentage_of_False_Positives = "",
                         Percentage_of_True_Negative = "",
                         Percentage_of_False_Negative = 1-exp.results_r1s2[, mean(p < 0.05)],
                         Percentage_of_True_Positives = exp.results_r1s2[, mean(p < 0.05)]
)
table_r1s2

##    Research_Question        Scenario Mean_Effect_in_Simulated_Data
## 1:        Question 1 Expected Effect                     0.1002933
##    Ninety_Five_Percent_Confidence_Interval_of_Mean_Effect
## 1:                                                    Inf
##    Percentage_of_False_Positives Percentage_of_True_Negative
## 1:                                                          
##    Percentage_of_False_Negative Percentage_of_True_Positives
## 1:                            0                            1

Research Question 2: Does a real world background image have a higher rate of adding products to shopping carts than white backgrounds?

Scenario 1: No Effect on Add to Cart Rate

n <- 3000
set.seed(1031)

#By randomly assign 3000 into two groups, each would have 1500.
add_to_cart_S1.dat <- data.table(Group = c(rep.int(x = "Treatment", times = n/2), rep.int(x = "Control", times = n/2)))

add_to_cart_S1.dat[Group == "Control", ACR := round(x = rbinom(n = 1500, size = 1, prob= 0.25))]
add_to_cart_S1.dat[Group == "Treatment", ACR := round(x = rbinom(n = 1500, size = 1, prob = 0.25))]
datatable(data = add_to_cart_S1.dat)

table(add_to_cart_S1.dat)

##            ACR
## Group          0    1
##   Control   1127  373
##   Treatment 1120  380

#Number of people in Treatment group with Add to Cart Rate as 1
add_to_cart_treatment_S1 = add_to_cart_S1.dat%>%
  filter(ACR==1, Group== 'Treatment')%>%
  nrow()

#Number of people in Control group with Add to Cart Rate as 1 
add_to_cart_control_S1 = add_to_cart_S1.dat%>%
  filter(ACR==1, Group == 'Control')%>%
  nrow()

Analysis

Applying the two sample proportion test

add_to_cart_S1 = prop.test(x = c(add_to_cart_treatment_S1,add_to_cart_control_S1), n = c(n/2, n/2),alternative = 'greater'); add_to_cart_S1

## 
##  2-sample test for equality of proportions with continuity correction
## 
## data:  c(add_to_cart_treatment_S1, add_to_cart_control_S1) out of c(n/2, n/2)
## X-squared = 0.06383, df = 1, p-value = 0.4003
## alternative hypothesis: greater
## 95 percent confidence interval:
##  -0.02204163  1.00000000
## sample estimates:
##    prop 1    prop 2 
## 0.2533333 0.2486667

Function

analyze.experiment <- function(the.dat) {
    require(data.table)
    setDT(the.dat)
    
    the.test <- t.test(x = the.dat[Group == "Treatment", 
        ACR], y = the.dat[Group == "Control", ACR], alternative = "greater")
    
    the.effect <- the.test$estimate[1] - the.test$estimate[2]
    upper.bound <- the.test$conf.int[2]
    p <- the.test$p.value
    
    result <- data.table(effect = the.effect, upper_ci = upper.bound, 
        p = p)
    
    return(result)
}
analyze.experiment(add_to_cart_S1.dat)

##         effect upper_ci        p
## 1: 0.004666667      Inf 0.384137

Repeat the function with Simulation

B <- 1000
n <- 3000
RNGversion(vstr = 3.6)
set.seed(1031)
Experiment <- 1:B
Group <- c(rep.int(x = "Treatment", times = n/2), rep.int(x = "Control", times = n/2))

sim.dat_r2s1 <- as.data.table(expand.grid(Experiment = Experiment, Group = Group))
setorderv(x = sim.dat_r2s1, cols = c("Experiment", "Group"), order = c(1,1))
sim.dat_r2s1[Group == "Control", ACR := round(x =  rbinom(n = .N, size = 1, prob= 0.25), digits = 1)]
sim.dat_r2s1[Group == "Treatment", ACR := round(x = rbinom(n = .N, size = 1, prob = 0.25), digits = 1)]
dim(sim.dat_r2s1)

## [1] 3000000       3

Results

exp.results_r2s1 <- sim.dat_r2s1[, analyze.experiment(the.dat = .SD), 
    keyby = "Experiment"] 

DT::datatable(data = round(x = exp.results_r2s1[1:100, ], digits = 3), 
    rownames = F)

pvalue = mean(exp.results_r2s1$p)
table_r2s1 <- data.table(Research_Question = "Question 2",
                         Scenario = "No Effect",
                         Mean_Effect_in_Simulated_Data = mean(exp.results_r2s1$effect),
                         Ninety_Five_Percent_Confidence_Interval_of_Mean_Effect = mean(exp.results_r2s1$upper_ci),
                         Percentage_of_False_Positives = exp.results_r2s1[, mean(p < 0.05)],
                         Percentage_of_True_Negative = 1-exp.results_r2s1[, mean(p < 0.05)],
                         Percentage_of_False_Negative = "",
                         Percentage_of_True_Positives = ""
)
table_r2s1

##    Research_Question  Scenario Mean_Effect_in_Simulated_Data
## 1:        Question 2 No Effect                  0.0001926667
##    Ninety_Five_Percent_Confidence_Interval_of_Mean_Effect
## 1:                                                    Inf
##    Percentage_of_False_Positives Percentage_of_True_Negative
## 1:                          0.06                        0.94
##    Percentage_of_False_Negative Percentage_of_True_Positives
## 1:

Scenario 2: An Expected Effect on Add to Cart Rate

n <- 3000
set.seed(1031)
#By randomly assign 3000 into two groups, each would have 1500.
add_to_cart_S2.dat <- data.table(Group = c(rep.int(x = "Treatment", times = n/2), rep.int(x = "Control", times = n/2)))

add_to_cart_S2.dat[Group == "Control", ACR := round(x = rbinom(n = 1500, size = 1, prob= 0.25))]
add_to_cart_S2.dat[Group == "Treatment", ACR := round(x = rbinom(n = 1500, size = 1, prob = 0.35))]
datatable(data = add_to_cart_S2.dat)

table(add_to_cart_S2.dat)

##            ACR
## Group          0    1
##   Control   1127  373
##   Treatment  972  528

#Number of people in Treatment group with Add to Cart Rate as 1
add_to_cart_treatment_S2 = add_to_cart_S2.dat%>%
  filter(ACR==1, Group== 'Treatment')%>%
  nrow()

#Number of people in Control group with Add to Cart Rate as 1
add_to_cart_control_S2 = add_to_cart_S2.dat%>%
  filter(ACR==1, Group == 'Control')%>%
  nrow()

Analysis

Applying the two sample proportion test

add_to_cart_S2 = prop.test(x = c(add_to_cart_treatment_S2 ,add_to_cart_control_S2), n = c(n/2, n/2),alternative = 'greater');add_to_cart_S2

## 
##  2-sample test for equality of proportions with continuity correction
## 
## data:  c(add_to_cart_treatment_S2, add_to_cart_control_S2) out of c(n/2, n/2)
## X-squared = 37.621, df = 1, p-value = 4.297e-10
## alternative hypothesis: greater
## 95 percent confidence interval:
##  0.07530971 1.00000000
## sample estimates:
##    prop 1    prop 2 
## 0.3520000 0.2486667

Function

analyze.experiment <- function(the.dat) {
    require(data.table)
    setDT(the.dat)
    
    the.test <- t.test(x = the.dat[Group == "Treatment", 
        ACR], y = the.dat[Group == "Control", ACR], alternative = "greater")
    
    the.effect <- the.test$estimate[1] - the.test$estimate[2]
    upper.bound <- the.test$conf.int[2]
    p <- the.test$p.value
    
    result <- data.table(effect = the.effect, upper_ci = upper.bound, 
        p = p)
    
    return(result)
}
analyze.experiment(add_to_cart_S2.dat)

##       effect upper_ci            p
## 1: 0.1033333      Inf 3.001506e-10

Repeat the function with Simulation

B <- 1000
n <- 3000
RNGversion(vstr = 3.6)
set.seed(1031)
Experiment <- 1:B
Group <- c(rep.int(x = "Treatment", times = n/2), rep.int(x = "Control", times = n/2))

sim.dat_r2s2 <- as.data.table(expand.grid(Experiment = Experiment, Group = Group))
setorderv(x = sim.dat_r2s2, cols = c("Experiment", "Group"), order = c(1,1))
sim.dat_r2s2[Group == "Control", ACR := round(x =  rbinom(n = .N, size = 1, prob= 0.25), digits = 1)]
sim.dat_r2s2[Group == "Treatment", ACR := round(x = rbinom(n = .N, size = 1, prob = 0.35), digits = 1)]
dim(sim.dat_r2s2)

## [1] 3000000       3

Results

exp.results_r2s2 <- sim.dat_r2s2[, analyze.experiment(the.dat = .SD), 
    keyby = "Experiment"] 

DT::datatable(data = round(x = exp.results_r2s2[1:100, ], digits = 3), 
    rownames = F)

pvalue = mean(exp.results_r2s2$p)
table_r2s2 <- data.table(Research_Question = "Question 2",
                         Scenario = "Expected Effect",
                         Mean_Effect_in_Simulated_Data = mean(exp.results_r2s2$effect),
                         Ninety_Five_Percent_Confidence_Interval_of_Mean_Effect = mean(exp.results_r2s2$upper_ci),
                         Percentage_of_False_Positives = "",
                         Percentage_of_True_Negative = "",
                         Percentage_of_False_Negative = 1-exp.results_r2s2[, mean(p < 0.05)],
                         Percentage_of_True_Positives = exp.results_r2s2[, mean(p < 0.05)]
)
table_r2s2

##    Research_Question        Scenario Mean_Effect_in_Simulated_Data
## 1:        Question 2 Expected Effect                    0.09999867
##    Ninety_Five_Percent_Confidence_Interval_of_Mean_Effect
## 1:                                                    Inf
##    Percentage_of_False_Positives Percentage_of_True_Negative
## 1:                                                          
##    Percentage_of_False_Negative Percentage_of_True_Positives
## 1:                            0                            1

Results Table

Results = rbind(table_r1s1,table_r1s2,table_r2s1,table_r2s2);
Results %>%
  kbl() %>%
  kable_styling()

Research_Question	Scenario	Mean_Effect_in_Simulated_Data	Ninety_Five_Percent_Confidence_Interval_of_Mean_Effect	Percentage_of_False_Positives	Percentage_of_True_Negative	Percentage_of_False_Negative	Percentage_of_True_Positives
Question 1	No Effect	0.0004700	Inf	0.052	0.948
Question 1	Expected Effect	0.1002933	Inf			0	1
Question 2	No Effect	0.0001927	Inf	0.06	0.94
Question 2	Expected Effect	0.0999987	Inf			0	1

References

Brewster, M. (2022, October 3). Annual retail trade survey shows impact of online shopping on retail sales during COVID-19 pandemic. Census.gov. https://www.census.gov/library/stories/2022/04/ecommerce-sales-surged-during-pandemic.html
Gonchigjav, B. (2020). Results of neuromarketing study of visual attention and emotions of buyers in retail store environment. Proceedings of the Mongolian Academy of Sciences, 52–64. https://doi.org/10.5564/pmas.v60i1.1337
Goswami, A., Chittar, N., & Sung, C. (2011). A study on the impact of product images on user clicks for online shopping. Proceedings of the 20th International Conference Companion on the World Wide Web, 45–46. https://doi.org/10.1145/1963192.1963216
Huang, J., Wang, Z., Liu, H., & Yu, L. (2020). Similar or contrastive? Impact of product–background color combination on consumers’ product evaluations. Psychology & Marketing, 37(7), 961–979. https://doi.org/10.1002/mar.21361
Jeon, M., & Yoh, E. (2014). Effect of Sensibility Responses on Backgrounds of Product Photos on Consumer Attitude of Online Shopping Malls. Fashion Business, 18(2), 29–41. https://doi.org/10.12940/jfb.2014.18.2.29
Kim, S. Y., Baek, G. Y., Choi, J. E., & Lee, H.-H. (2014). The Effects of Product Presentation and Background of Photos in Internet Shopping Malls on Consumer Perceptions. Journal of the Korean Society of Clothing and Textiles, 38(4), 467–481. https://doi.org/10.5850/JKSCT.2014.38.4.467
Stop doing it wrong. the importance of product photos in e-commerce. (2022, August 18). Ergonode. Retrieved October 16, 2022, from https://www.ergonode.com/blog/the-importance-of-product-photos-in-e-commerce
ReportLink. (2022, March). E-Commerce Furniture Market by Type, by Product Type, by Material Type, by End Use, and by Price Range – Global Opportunity Analysis and Industry Forecast 2022-2030. https://www.reportlinker.com/p06272294/E-Commerce-Furniture-Market-by-Type-by-Product-Type-by-Material-Type-by-End-Use-and-by-Price-Range-Global-Opportunity-Analysis-and-Industry-Forecast.html?utm_source=GNW
Szász, L., Bálint, C., Csíki, O., Nagy, B. Z., Rácz, B.-G., Csala, D., & Harris, L. C. (2022). The impact of COVID-19 on the evolution of online retail: The pandemic as a window of opportunity. Journal of Retailing and Consumer Services, 69, 103089–103089. https://doi.org/10.1016/j.jretconser.2022.103089
Published by Statista Research Department, & 4, J. (2022, January 4). US: Furniture e-retail revenue 2017-2025. Statista. Retrieved December 4, 2022, from https://www.statista.com/statistics/257524/us-furniture-and-home-furnishings-e-commerce-revenue/

The Effects of Real Life Background of Online Product Photos on Consumer Behavior.

Part 1: Research Proposal

Executive Summary / Abstract

Statement of the Problem

Research Questions, Hypotheses, and Effects

Literature Review

Research Plan

Statistical Analysis Plan

Sample Size and Statistical Power

Possible Recommendations

Limitations and Uncertainties

Part 2: Simulation Effects

Simulating the statistical power

Research Question 1: Does a real world background image have a greater purchasing rate than white backgrounds?

Scenario 1: No Effect on Purchase Rate

Analysis

Function

Repeat the Experiment with Simulation

Results

Scenario 2: An Expected Effect on Purchase Rate

Analysis

Function

Repeat the Experiment with Simulation

Results

Research Question 2: Does a real world background image have a higher rate of adding products to shopping carts than white backgrounds?

Scenario 1: No Effect on Add to Cart Rate

Analysis

Function

Repeat the function with Simulation

Results

Scenario 2: An Expected Effect on Add to Cart Rate

Analysis

Function

Repeat the function with Simulation

Results

Results Table

References