2.99 See Answer

Question: Starbucks Coffee Co. uses a data-based


Starbucks Coffee Co. uses a data-based approach for improving the quality and customer satisfaction of its products. When survey data indicated that Starbucks needed to improve its package sealing process, an experiment was conducted to determine the factors in the bag-sealing equipment that might be affecting the ease of opening the bag without tearing the inner liner of the bag.
Source: Data extracted from L. Johnson and S. Burrows, “For Starbucks, It’s in the Bag,” Quality Progress, March 2011, pp. 17–23.
Among the factors that could affect the rating of the ability of the bag to resist tears were the viscosity, pressure, and plate gap on the bag-sealing equipment. Data were collected on 19 bags in which the plate gap was varied and the results were stored in Starbucks .
a. Using all the data as the training sample, develop a regression tree model to predict the rating of the ability of the bag to resist tears.
b. What conclusions can you reach about the rating of the ability of the bag to resist tears?


> A market research study has been conducted by a travel website that specializes in restaurants with the business objective to determine which food cuisines are perceived to be similar and which are perceived to be different. The following cuisine types w

> Using the yearly amount of solar power generated by utilities (in millions of kWh) in the United States from 2002 through 2016 data for Problem 16.16 on page 645 (stored in SolarPower ), a. fit a third-order autoregressive model to the amount of solar p

> Using the average baseball salary from 2000 through 2017 data for Problem 16.18 on page 645 (stored in BBSalaries ), a. fit a third-order autoregressive model to the average baseball salary and test for the significance of the third-order autoregressive

> Using the data for Problem 16.17 on page 645 concerning the number of passenger cars produced in the United States from 1999 to 2016 (stored in CarProduction ), a. fit a third-order autoregressive model to the number of passenger cars produced in the Uni

> Using the data for Problem 16.12 on page 645 concerning the bonuses paid to workers on Wall Street from 2000 to 2016 (stored in Bonuses ), a. fit a third-order autoregressive model to the bonuses paid and test for the significance of the third-order auto

> Using the data for Problem 16.15 on page 645 that represent the number of new, single-family houses sold in the U.S. from 1992 through 2016 (stored in HouseSales ), a. fit a third-order autoregressive model to the new single-family homes sold and test fo

> Refer to Problem 16.24. Suppose, when testing for the appropriateness of the fitted model, the standard errors are Sa1 = 0.45 Sa2 = 0.35 Sa3 = 0.15 a. What conclusions can you reach? b. Discuss how to proceed if forecasting is still your main objective.

> How does multiple correspondence analysis differ from multidimensional scaling?

> In Problem 20.16, an author is deciding which of two competing publishing companies to select to publish her new novel. Prior to making a final decision, the author decides to have an experienced reviewer examine her novel. This reviewer has an outstandi

> How do classification trees differ from regression trees?

> What is the difference between supervised and unsupervised analytics methods?

> Have you wondered how Internet connection speed varies around the globe? The file ConnectionSpeed contains the mean connection speed, the mean peak connection speed, the percent of the time the connection speed is above 4 mbps, and the percent of the tim

> The file MobileSpeed contains the overall download and upload speeds in mbps for nine carriers in the United States. Source: Data extracted from “Best Mobile Network 2016,” bit.ly/1KGPrMm, accessed November 10, 2016. a. Perform a multidimensional scaling

> A Pew Research Center survey found that social networking is popular in many nations around the world. The file GlobalSocialMedia contains the level of social media networking (measured as the percent of individuals polled who use social networking sites

> The file Protein contains calorie and cholesterol information for popular protein foods (fresh red meats, poultry, and fish) compiled by the U.S. Department of Agriculture. a. Perform a multidimensional scaling analysis on the protein foods based on the

> The file Cereals contains the calories, carbohydrates, and sugar, in grams, in one serving of seven breakfast cereals. a. Perform a multidimensional scaling analysis on the cereals based on the calories, carbohydrates, and sugar in grams. b. What conclus

> Movie companies need to predict the gross receipts of individual movies once the movie has debuted. The following results, stored in PotterMovies , are the first weekend gross, the U.S. gross, and the worldwide gross (in $millions) of the Harry Potter mo

> The file Social Response contains the product category, sentiment rating, and customer type and frequency of posting (low, average, high) for 300 recently posted comments to a retailer’s community website. a. Conduct a multiple correspondence analysis of

> The file HybridSales contains the number of domestic and imported hybrid vehicles sold in the United States from 1999 to 2016. Source: Data extracted from Oak Ridge National Laboratory, “Vehicle Technologies Market Report,” bit.ly/2xrcrtO. You want to be

> In Problem 20.14, an investor is trying to determine the optimal investment decision among three investment opportunities. Prior to making his investment decision, the investor decides to consult with his financial adviser. In the past, when the economy

> A survey was conducted on the characteristics of households in the United States. The data (which have been altered from an actual study to preserve the anonymity of the respondents) are stored in Households . The variables are gender, age, Hispanic orig

> A mining company operates a large heap-leach gold mine in the western United States. The gold mined at this location consists of ore that is very low grade, having about 0.0032 ounce of gold in 1 ton of ore. The process of heap-leaching involves the mini

> The data in the file BankMarketing are from a direct marketing campaign conducted by a Portuguese banking institution. Source: Data extracted from S. Moro, R. Laureano, and P. Cortez, “Using Data Mining for Bank Direct Marketing: An Application of the CR

> Zagat’s publishes restaurant ratings for various locations in the United States. The file Restaurants2 contains the Zagat rating for food, décor, service, cost per person, and popularity index (popularity points the restaurant received divided by the num

> A study was conducted to determine whether any gender bias existed in an academic science environment. Faculty from several universities were asked to rate candidates for the position of undergraduate laboratory manager based on their application. The ge

> The file UsedCars contains attributes of cars that are currently part of an inventory of a used car dealership. The variables included are car, year, age, price ($), mileage, power (hp), and fuel (mpg). Source: Data extracted from www.truecar.com/used-ca

> Professional basketball has truly become a sport that generates interest among fans around the world. More and more players come from outside the United States to play in the National Basketball Association (NBA). Many factors could impact the number of

> The file Philly contains a sample of 25 neighborhoods in Philadelphia. Variables included are neighborhood population, median sales price of homes in the second quarter of 2017, mean number of days homes were on the market in the second quarter of 2017,

> The file EuroTourism2 contains a sample of 28 European countries. Variables included are the number of jobs generated in the travel and tourism industry in 2015, the spending on business travel within the country by residents and international visitors i

> Repeat Problem 17.2 for the Cincinnati Reds. Problem 17.2: Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether the team is having a good season, and

> In Problem 20.12, a vendor at a baseball stadium is deciding whether to sell ice cream or soft drinks at today’s game. Prior to making her decision, she decides to listen to the local weather forecast. In the past, when it has been cool, the weather repo

> Repeat Problem 17.2 for the Chicago Cubs. Problem 17.2: Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether the team is having a good season, and whe

> Repeat Problem 17.2 for the Philadelphia Phillies. Problem 17.2: Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether the team is having a good season

> Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether the team is having a good season, and whether a marketing promotion is held. Popular promotions du

> In many manufacturing processes, the term work-in-process (often abbreviated WIP) is used. At the LSS Publishing book manufacturing plants, WIP represents the time it takes for sheets from a press to be folded, gathered, sewn, tipped on end sheets, and b

> The restaurant owner in Problem 2.91 continues to learn more about the weekend patterns of patron demand. For each patron, the owner has collected and stored in Patrons the gender, the entrée ordered, the dessert ordered, and payment method. a. Conduct a

> Have you wondered how Internet connection speed varies around the globe? The file ConnectionSpeed contains the mea connection speed, the mean peak connection speed, the percent of the time the connection speed is above 4 mbps, and the percent of the time

> The file MobileSpeed contains the overall download and upload speeds in mbps for nine carriers in the United States. Source: Data extracted from “Best Mobile Network 2016,” bit.ly/1KGPrMm, accessed November 10, 2016. a. Perform a cluster analysis using t

> A Pew Research Center survey found that social networking is popular in many nations around the world. The file GlobalSocialMedia contains the level of social media networking (measured as the percent of individuals polled who use social networking sites

> The file Protein contains calorie and cholesterol information for popular protein foods (fresh red meats, poultry, and fish) compiled by the U.S. Department of Agriculture. a. Perform a cluster analysis using the complete linkage method on the protein f

> The file Cereals contains the calories, carbohydrates, and sugar, in grams, in one serving of seven breakfast cereals. a. Perform a cluster analysis using the complete linkage method on the cereals based on the calories, carbohydrates, and sugar in gram

> Consider the following payoff table: For this problem, P(E1) = 0.8, P(E2) = 0.1, P(E3) = 0.1, P(F | E1) = 0.2, P(F | E2) = 0.4, and P(F | E3) = 0.4. Suppose you are informed that event F occurs. a. Revise the probabilities P(E1), P(E2), and P(E3) now

> Movie companies need to predict the gross receipts of individual movies once the movie has debuted. The following results, stored in PotterMovies , are the first weekend gross, the U.S. gross, and the worldwide gross (in $millions) of the Harry Potter mo

> Undergraduate students at Miami University in Oxford, Ohio, were surveyed in order to evaluate the effect of price on the purchase of a pizza from Pizza Hut. The students were asked to suppose that they were going to have a large two-topping pizza delive

> An automotive insurance company wants to predict which filed stolen vehicle claims are fraudulent, based on the number of claims submitted per year by the policy holder and whether the policy is a new policy, that is, is one year old or less (coded as 1

> A marketing manager wants to predict customers with risk of churning (switching their service contracts to another company) based on the number of calls the customer makes to the company call center and the number of visits the customer makes to the loca

> A hotel has designed a new system for room service delivery of breakfast that allows the customer to select a specific delivery time. The file Satisfaction contains the difference between the actual and requested delivery times (a negative time means tha

> The owner of a moving company typically has his most experienced manager predict the total number of labor hours that will be required to complete an upcoming move. This approach has proved useful in the past, but the owner has the business objective of

> In mining engineering, holes are often drilled through rock using drill bits. As a drill hole gets deeper, additional rods are added to the drill bit to enable additional drilling to take place. It is expected that drilling time increases with depth. Thi

> The business problem facing a consumer products company is to measure the effectiveness of different types of advertising media in the promotion of its products. Specifically, the company is interested in the effectiveness of radio advertising and newspa

> Using the bonuses paid to workers on Wall Street data for Problem 16.12 on page 645 and Problem 16.28 on page 655 (stored in Bonuses ), a. perform a residual analysis for each model. b. compute the standard error of the estimate (SYX) for each model. c.

> Consider the following payoff table: For this problem, P(E1) = 0.5, P(E2) = 0.5, P(F | E1) = 0.6, and P(F | E2) = 0.4. Suppose that you are informed that event F occurs. a. Revise the probabilities P(E1) and P(E2) now that you know that event F has occ

> Using the new, single-family house sales data for Problem 16.15 on page 645 and Problem 16.27 on page 654 (stored in HouseSales ), a. perform a residual analysis for each model. b. compute the standard error of the estimate (SYX) for each model. c. compu

> Using the yearly amount of solar power generated by utilities (in millions of kWh) in the United States data for Problem 16.16 on page 645 and Problem 16.31 on page 655 (stored in SolarPower), a. perform a residual analysis. b. compute the standard error

> Refer to Problem 16.32. Suppose the first residual is 12.0 (instead of 2.0) and the last residual is -11.0 (instead of -1.0). a. Compute SYX and interpret your findings Compute the MAD and interpret your findings. Problem 16.32: The following residuals

> The following residuals are from a linear trend model used to forecast sales: 2.0 -0.5 1.5 1.0 0.0 1.0 -3.0 1.5 -4.5 2.0 0.0 -1.0 a. Compute SYX and interpret your findings. b. Compute the MAD and interpret your findings.

> Refer to Problem 16.24. The three most recent values are Y15 = 23 Y16 = 28 Y17 = 34 Forecast the values for the next year and the following year. Problem 16.24: A third-order autoregressive model is fitted to an annual time series with 17 values and h

> A third-order autoregressive model is fitted to an annual time series with 17 values and has the following estimated parameters and standard errors: At the 0.05 level of significance, test the appropriateness of the fitted model.

> You are given an annual time series with 40 consecutive values and asked to fit a fifth-order autoregressive model. a. How many comparisons are lost in developing the autoregressive model? b. How many parameters do you need to estimate? c. Which of the o

> A time-series plot often helps you determine the appropriate model to use. For this problem, use each of the time series presented in the following table and stored in TSModel2 : a. Plot the observed data Y over time X and plot the logarithm of the obs

> Although you should not expect a perfectly fitting model for any time-series data, you can consider the first differences, second differences, and percentage differences for a given series as guides in choosing an appropriate model. For this problem, u

> The data in CPI-U reflect the annual values of the consumer price index (CPI) in the United States over the 52-year period 1965 through 2016, using 1982 through 1986 as the base period. This index measures the average change in prices over time in a fixe

> In Problem 20.5, you developed a payoff table for whether to purchase 100, 200, 500, or 1,000 Christmas trees. Given the results of that problem, suppose that the probabilities of the demand for the different number of trees are as follows: a. Determin

> The file Silver contains the following prices in London for an ounce of silver (in US$) on the last day of the year from 1999 to 2016: a. Plot the data. b. Compute a linear trend forecasting equation and plot the trend line. c. Compute a quadratic tren

> The average salary of Major League Baseball players on opening day from 2000 to 2017 is stored in BBSalaries and shown below. a. Plot the data. b. Compute a linear trend forecasting equation and plot the trend line. c. Compute a quadratic trend forecas

> The file CarProduction contains the number of passenger cars produced in the U.S. (in thousands) from 1999 to 2016. Source: Data extracted from www.statista.com. a. Plot the data. b. Compute a linear trend forecasting equation and plot the trend line. c.

> The data shown in the following table and stored in Solar Power represent the yearly amount of solar power generated by utilities (in millions of kWh) in the United States from 2002 through 2016: a. Plot the data. b. Compute a linear trend forecasting

> The file HouseSales contains the number of new, single-family houses sold in the U.S. from 1992 through 2016. a. Plot the data. b. Compute a linear trend forecasting equation and plot the trend line. c. Compute a quadratic trend forecasting equation and

> The data in FedReceipt represent federal receipts from 1978 through 2016, in billions of current dollars, from individual and corporate income tax, social insurance, excise tax, estate and gift tax, customs duties, and federal reserve deposits. Source: D

> Gross domestic product (GDP) is a major indicator of a nation’s overall economic activity. It consists of personal consumption expenditures, gross domestic investment, net exports of goods and services, and government consumption expenditures. The file G

> There has been much publicity about bonuses paid to workers on Wall Street. Just how large are these bonuses? The file Bonuses contains the bonuses paid (in $000) from 2000 to 2016. Source: Data extracted from J. Spector, “Wall Street bonuses rise 1% to

> The linear trend forecasting equation for an annual time series containing 42 values (from 1976 to 2017) on net sales (in $billions) is a. Interpret the Y intercept, b0. b. Interpret the slope, b1. c. What is the fitted trend value for the tenth year?

> The linear trend forecasting equation for an annual time series containing 22 values (from 1996 to 2017) on total revenues (in $millions) is a. Interpret the Y intercept, b0. b. Interpret the slope, b1. c. What is the fitted trend value for the fifth y

> In Problem 20.4, you developed a payoff table to assist an author in choosing between signing with company A or with company B. Given the results computed in that problem, suppose that the probabilities of the levels of demand for the novel are as follow

> If you are using the method of least squares for fitting trends in an annual time series containing 25 consecutive yearly values, a. what coded value do you assign to X for the first year in the series? b. what coded value do you assign to X for the fift

> The file IPOs contains the number of initial public offerings (IPOs) issued from 2001 through 2016. Source: Data extracted from K.W. Hanley, “The Economics of Primary Markets,” available at bit.ly/2vWb6hv. a. Plot the data. b. Fit a three-year moving ave

> The data (stored in CoffeeExports ) represent the coffee exports (in thousands of 60 kg bags) by Costa Rica from 2004 to 2016: a. Plot the data. b. Fit a three-year moving average to the data and plot the results. c. Using a smoothing coefficient of W =

> How have stocks performed in the past? The following table presents the data stored in Stock Performance , which show the performance of a broad measure of stock performance (by percentage) for each decade from the 1830s through the 2000s: a. Plot the

> The following data, stored in CoreAppliances provide the total number of shipments of core major household appliances in the U.S. from 2000 to 2016 (in millions). Source: Data extracted from www.statistica.com. a. Plot the time series. b. Fit a three-y

> The data below (stored in DesktopLaptop ) represent the hours per day spent by American desktop/ laptop users from 2008 to 2016. Source: Data extracted from M. Meeker, Internet Trends 2017-Code Conference, available at bit.ly/2vW8Nej. a. Plot the time

> You are using exponential smoothing on an annual time series concerning total revenues (in $millions). You decide to use a smoothing coefficient of W = 0.20, and the exponentially smoothed value for 2017 is E2017 = (0.20)(12.1) + (0.80)(9.4). a. What is

> Consider a nine-year moving average used to smooth a time series that was first recorded in 1984. a. Which year serves as the first centered value in the smoothed series? b. How many years of values in the series are lost when computing all the nine-year

> If you are using exponential smoothing for forecasting an annual time series of revenues, what is your forecast for next year if the smoothed value for this year is $32.4 million?

> In Problems 15.32–15.36 you developed multiple regression models to predict the fair market value of houses in Glen Cove, Roslyn, and Freeport. Now write a report based on the models you developed. Append all appropriate charts and statistical informatio

> In Problem 20.3, you developed a payoff table for building a small factory or a large factory for manufacturing designer jeans. Given the results of that problem, suppose that the probabilities of the demand are as follows: a. Determine the optimal act

> For the following payoff table, the probability of event 1 is 0.5, and the probability of event 2 is also 0.5: a. Determine the optimal action based on the maximax criterion. b. Determine the optimal action based on the maximin criterion. c. Compute th

> The random variable is the number of nonconforming solder connections on a printed circuit board with 1000 connections.

> Actual lengths of stay at a hospital’s emergency department in 2009 are shown in the following table (rounded to the nearest hour). Length of stay is the total of wait and service times. Some longer stays are also approximated as 15 hou

> The distribution of the time until a Web site changes is important to Web crawlers that search engines use to maintain current information about Web sites. The distribution of the time until change (in days) of a Web site is approximated in the following

> Consider the visits that result in leave without being seen (LWBS) at an emergency department in Example 2.6. Assume that people independently arrive for service at hospital l. a. What is the probability that the fifth visit is the first one to LWBS? b.

> Suppose that lesions are present at 5 sites among 50 in a patient. A biopsy selects 8 sites randomly (without replacement). a. What is the probability that lesions are present in at least one selected site? b. What is the probability that lesions are pre

> A utility company might offer electrical rates based on time-of-day consumption to decrease the peak demand in a day. Enough customers need to accept the plan for it to be successful. Suppose that among 50 major customers, 15 would accept the plan. The u

> Suppose that a healthcare provider selects 20 patients randomly (without replacement) from among 500 to evaluate adherence to a medication schedule. Suppose that 10% of the 500 patients fail to adhere with the schedule. Determine the following: a. Probab

> a. For Exercise 3.7.1, calculate P(X = 1) and P(X = 4), assuming that X has a binomial distribution, and compare these results to results derived from the hypergeometric distribution. b. Use the binomial approximation to the hypergeometric distribution t

> A slitter assembly contains 48 blades. Five blades are selected at random and evaluated each day for sharpness. If any dull blade is found, the assembly is replaced with a newly sharpened set of blades. a. If 10 of the blades in an assembly are dull, wha

> A state runs a lottery in which six numbers are randomly selected from 40 without replacement. A player chooses six numbers before the state’s sample is selected. a. What is the probability that the six numbers chosen by a player match all six numbers in

> The number of surface flaws in plastic panels used in the interior of automobiles has a Poisson distribution with a mean of 0.05 flaw per square foot of plastic panel. Assume that an automobile interior contains 10 square feet of plastic panel. a. What i

> The analysis of results from a leaf transmutation experiment (turning a leaf into a petal) is summarized by the type of transformation completed: A naturalist randomly selects three leaves from this set without replacement. Determine the following proba

> Printed circuit cards are placed in a functional test after being populated with semiconductor chips. A lot contains 140 cards, and 20 are selected without replacement for functional testing. a. If 20 cards are defective, what is the probability that at

> A research study uses 800 men under the age of 55. Suppose that 30% carry a marker on the male chromosome that indicates an increased risk for high blood pressure. a. If 10 men are selected randomly and tested for the marker, what is the probability that

2.99

See Answer