Скачать презентацию DECISION MODELING WITH MICROSOFT EXCEL Chapter 13 Part Скачать презентацию DECISION MODELING WITH MICROSOFT EXCEL Chapter 13 Part

777dab0a789c6af11a2397a44dea04be.ppt

  • Количество слайдов: 52

DECISION MODELING WITH MICROSOFT EXCEL Chapter 13 Part 1 Copyright 2001 Prentice Hall Publishers DECISION MODELING WITH MICROSOFT EXCEL Chapter 13 Part 1 Copyright 2001 Prentice Hall Publishers and Ardith E. Baker

Many important decisions made by individuals and organizations crucially depend on an assessment of Many important decisions made by individuals and organizations crucially depend on an assessment of the_____. There a few “____” sayings that illustrate the promise and frustration of forecasting: “It is difficult to_____, especially in regards to the future. ” “It isn’t difficult to forecast, just to forecast ______. ” “_______, if tortured enough, will confess to just about anything. ”

Economic forecasts_____ Government policies and business decisions Insurance companies’ ______decisions in mortgages and bonds Economic forecasts_____ Government policies and business decisions Insurance companies’ ______decisions in mortgages and bonds Service industries’ (such as airlines, hotels, rental cars, cruise lines, etc. ) forecasts of _______as input for revenue management Forecasting is playing an increasingly important role in the________. There is clearly a steady _____in the use of quantitative forecasting models at many levels in industry and government. The many types of forecasting models will be distributed into two major techniques: ______and______

______forecasting models possess two important and attractive features: 1. They are expressed in mathematical ______forecasting models possess two important and attractive features: 1. They are expressed in mathematical ____. Thus, they establish an unambiguous record of how the forecast is made. 2. With the use of ________and computers, quantitative models can be based on an amazing quantity of data. Two types of quantitative forecasting models that will be discussed in the next two sections are: ____models and _____models

In a _______forecasting model, the forecast for the quantity of interest “rides piggyback” on In a _______forecasting model, the forecast for the quantity of interest “rides piggyback” on another quantity or set of quantities. In other words, our ____of the value of one variable (or perhaps several variables) enables us to forecast the value of another______. In this model, let y denote the _____of some variable of interest and ^ denote a predicted or _____value for y that variable.

Then, in a causal model, ^ = f (x , … x ) y Then, in a causal model, ^ = f (x , … x ) y 1 2 n where f is a forecasting____, or function, and x 1, x 2 , … xi , is a set of variables In this representation, the x variables are often called _____variables, whereas y ^ the is dependent or _____variable. We either _______the independent variables in advance or can forecast them more easily than ^ y. Then the independent variables will be used in the forecasting model to forecast the _____ variable.

Companies often find by looking at past _____that their monthly sales are directly related Companies often find by looking at past _____that their monthly sales are directly related to the monthly______, and thus figure that a good forecast could be made using next month’s GDP figure. The only problem is that this quantity is not _______, or it may just be a forecast and thus not a truly independent______. To use a causal forecasting model, requires two conditions: 1. There must be a ______between values of the independent and dependent variables such that the former provides ______about the latter.

2. The _______for the independent variables must be known and available to the forecaster 2. The _______for the independent variables must be known and available to the forecaster at the ____the forecast is made. Simply because there is a mathematical relationship does not ______that there is really cause and effect. One commonly used approach in creating a causal forecasting model is called_______. CURVE FITTING: AN OIL COMPANY EXPANSION Consider an oil company that is planning to expand its _____of modern self-service gasoline stations.

The company plans to use _____(measured in the average number of cars per hour) The company plans to use _____(measured in the average number of cars per hour) to forecast ______(measured in average dollar sales per hour). The firm has had five stations in operation for more than a year and has used _____data to calculate the following averages:

The averages are plotted in a scatter diagram. The averages are plotted in a scatter diagram.

Now, these data will be used to construct a _____that will be used to Now, these data will be used to construct a _____that will be used to forecast sales at any proposed location by measuring the traffic flow at that ____and plugging its value into the constructed function. Least Squares Fits The method of _____is a formal procedure for curve fitting. It is a twostep process. 1. Select a specific functional form (e. g. , a ______or quadratic curve). 2. Within the set of functions specified in step 1, choose the specific function that _____the sum of the squared deviations between the data points and the function______.

To demonstrate the process, consider the salestraffic flow example. 1. Assume a _______line; that To demonstrate the process, consider the salestraffic flow example. 1. Assume a _______line; that is, functions of the form y = a + bx. 2. Draw the line in the ______and indicate the _____between observed points and the function as di. For example, d 1 = y 1 – [a +bx 1] = 220 – [a + 150 b] where y 1 = actual sales/hr at location 1 x 1 = actual traffic flow at location 1 a = y-axis intercept for the function b = slope for the function

y d 3 d 5 d 1 y = a + bx d 4 y d 3 d 5 d 1 y = a + bx d 4 d 2 x The value d 12 is one measure of _____the value of the function [a +bx 1] is to the ____ value, y 1; that is it indicates how well the function fits at this one point.

One measure of how well the function fits overall is the sum of the One measure of how well the function fits overall is the sum of the _________: 5 S di 2 i=1 Consider a ____model with n as opposed to five______. Since each di = yi – (a +bxi), the sum of the squared deviations can be written as: n (yi – [a +bxi])2 S i=1 Using the method of_____, select a and b so as to minimize the sum in the equation above.

Now, take the _____derivative of the sum with respect to a and set the Now, take the _____derivative of the sum with respect to a and set the resulting expression equal to______. n S -2(yi – [a +bxi]) = 0 i=1 A second _____is derived by following the same procedure with b. n S -2 xi (yi – [a +bxi]) = 0 i=1 Recall that the values for xi and yi are the _______, and our goal is to find the values of a and b that satisfy these two equations.

The solution is: n b= xiy i - 1 S n i=1 n n The solution is: n b= xiy i - 1 S n i=1 n n n S xi S y i i=1 n xi 2 - 1 S n i=1 S xi i=1 n 2 n 1 Sy - b 1 S x a= n i=1 i i=1 The next step is to determine the values for: n S xi i=1 n S xi 2 i=1 n S yi i=1 n S xiy i i=1 Note that these _______depend only on observed data and can be found with simple arithmetic ______or automatically using Excel’s predefined______.

Using Excel, click on Tools – Data Analysis … In the resulting dialog, choose Using Excel, click on Tools – Data Analysis … In the resulting dialog, choose Regression.

In the _____dialog, enter the Y-range and X-range. Choose to place the _______in a In the _____dialog, enter the Y-range and X-range. Choose to place the _______in a new worksheet called Results Select ______and Normal Probability Plots to be created along with the output.

Click OK to produce the following results: Note that a (Intercept) and b (X Click OK to produce the following results: Note that a (Intercept) and b (X Variable 1) are reported as 57. 104 and 0. 92997, respectively.

To add the resulting ______line, first click on the worksheet Chart 1 which contains To add the resulting ______line, first click on the worksheet Chart 1 which contains the original_______. Next, click on the ______so that they are highlighted and then choose Add Trendline … from the Chart pull-down menu.

Choose Linear Trend in the resulting dialog and click OK. Choose Linear Trend in the resulting dialog and click OK.

A linear trend is fit to the data: A linear trend is fit to the data:

One of the other _____output values that is given in Excel is: R Square One of the other _____output values that is given in Excel is: R Square = 69. 4% This is a “_____” measure which represents the R 2 statistic discussed in introductory statistics classes. R 2 ranges in value from _____and gives an indication of how much of the total ____in Y from its mean is explained by the new trend line. In fact, there are three different sums of errors: TSS (____Sum of Squares) ESS (____Sum of Squares) RSS (____Sum of Squares)

The basic relationship between them is: TSS = ESS + RSS They are defined The basic relationship between them is: TSS = ESS + RSS They are defined as follows: n – 2 TSS = S (Yi – Y ) i=1 ESS = n ^ S (Y i – Y i )2 i=1 n ^ – 2 RSS = S (Yi – Y ) i=1 Essentially, the ____is the amount of variation that can’t be explained by the______. The ____quantity is effectively the amount of the ____, total variation (TSS) that could be removed using the regression line.

R 2 is defined as: R 2 RSS = TSS If the regression line R 2 is defined as: R 2 RSS = TSS If the regression line fits____, then ESS = 0 and RSS = TSS, resulting in R 2 = 1. In this example, R 2 =. 694 which means that approximately 70% of the variation in the Y values is explained by the one ______ variable (X), cars per hour.

Now, returning to the original question: Should we build a station at Buffalo Grove Now, returning to the original question: Should we build a station at Buffalo Grove where traffic is 183 cars/hour? The best guess at what the corresponding _____ volume would be is found by placing this X value into the new ______equation: ^ y = a + b * x Sales/hour = 57. 104 + 0. 92997 * (183 cars/hour) = $227. 29 However, it would be nice to be able to state a _____confidence interval around this best guess.

We can get the information to do this from Excel’s Summary Output. Excel reports We can get the information to do this from Excel’s Summary Output. Excel reports that the _______(Se) is 44. 18. This quantity represents the amount of ______in the actual data around the regression line. The formula for Se is: n Se = ^ S (Y i – Y i )2 i=1 n – k -1 Where n is the number of data points (e. g. , 5) and k is the number of ______variables (e. g. , 1).

This equation is _____to: ESS n – k -1 Once we know Se and This equation is _____to: ESS n – k -1 Once we know Se and based on the ______ distribution, we can state that • We have 68% confidence that the _____ value of sales/hour is within + 1 Se of the predicted value ($277. 29). • We have 95% confidence that the actual value of _____/hour is within + 2 Se of the predicted value ($277. 29). The 95% ______interval is: [277. 29 – 2(44. 18); 227. 29 + 2(44. 18)] [$138. 93; $315. 65]

Another value of interest in the Summary report is the ______for the X variable Another value of interest in the Summary report is the ______for the X variable and its associated values. The t-statistic is 2. 61 and the ____is 0. 0798. A P-value less than 0. 05 represents that we have at least 95% confidence that the ____parameter (b) is statistically significantly than 0 (zero). A slope of __results in a flat trend ______and indicates no relationship between Y and X. The 95% confidence limit for b is [-0. 205; 2. 064] Thus, we can’t _____the possibility that the true value of b might be 0.

Also given in the Summary report is the _______. Since there is only ____ Also given in the Summary report is the _______. Since there is only ____ independent variable, the F –significance is identical to the P-value for the t-statistic. In the case of more than one X variable, the F – significance tests the ______that all the X variable parameters as a group are statistically significantly different than zero.

Concerning multiple regression____, as you add other X variables, the R 2 statistic will Concerning multiple regression____, as you add other X variables, the R 2 statistic will always _______, meaning the RSS has increased. In this case, the _____ R 2 statistic is a reliable _____of the true goodness of fit because it compensates for the reduction in the ____due to the addition of more independent variables. Thus, it may report a _____adjusted R 2 value even though R 2 has increased, unless the improvement in ____is more than compensated for by the _____of the new independent variables.

Fitting a Quadratic Function The method of least ____can be used with any number Fitting a Quadratic Function The method of least ____can be used with any number of independent variables and with any _____ form (not just linear). Suppose that we wish to fit a _____function of the form y = a 0 + a 1 x + a 2 x 2 to the previous data with the method of least squares. The goal is to select a 0 , a 1 , and a 2 in order to _____the sum of squared deviations, which is now 5 S (yi – [a 0 + a 1 xi + a 2 xi 2 ])2 i=1

Proceed by setting the partial ______with respect to a 0 , a 1 , Proceed by setting the partial ______with respect to a 0 , a 1 , and a 2 equal to______. This gives the equations 5 a 0 + (Sxi)a 1 + (Sxi 2 )a 2 = Syi (Sxi)a 0 + (Sxi 2)a 1 + (Sxi 3)a 2 = Sxiyi (Sxi 2)a 0 + (Sxi 3)a 1 + (Sxi 4)a 2 = Sxi 2 yi This is a simple set of three linear equations in three_____. Thus, the general name for this least squares curve fitting is “__________. ” The term _____comes from the fact that simultaneous linear equations are being solved.

Solver will be used to find the coefficients in Excel. Consider the following worksheet: Solver will be used to find the coefficients in Excel. Consider the following worksheet:

Now, to find the ______values for the parameters (a 0 , a 1 , Now, to find the ______values for the parameters (a 0 , a 1 , and a 2) using____, first click on Tools – Solver.

In the resulting Solver Parameter dialog, specify the following settings: Click Solve to solve In the resulting Solver Parameter dialog, specify the following settings: Click Solve to solve the______, nonlinear optimization model. In this model, the objective function is to minimize the sum of________.

Here are the Solver results. The parameter values are: This formula calculates the sum Here are the Solver results. The parameter values are: This formula calculates the sum of squared errors directly.

Use Excel’s Chart Wizard to plot the _______data and the resulting ______function. First, highlight Use Excel’s Chart Wizard to plot the _______data and the resulting ______function. First, highlight the original range of data, then click on the _______button.

Use Excel’s Chart Wizard to plot the original data as a _____and specify a Use Excel’s Chart Wizard to plot the original data as a _____and specify a quadratic function via the Chart – Add Trendline … option.

Comparing the Linear and Quadratic Fits In the method of least squares, the _____of Comparing the Linear and Quadratic Fits In the method of least squares, the _____of the squared deviations was selected as the measure of “_______. ” Thus, the linear and quadratic fits can be compared with this______. In order to make this comparison, go back to the linear regression “____” spreadsheet and make the corresponding calculation in the original “______” spreadsheet.

Note that the sum of the squared deviations for the ____function is indeed smaller Note that the sum of the squared deviations for the ____function is indeed smaller than that for the ______function (i. e. , 4954 < 5854. 7). Indeed, the quadratic gives roughly a 15% _____in the sum of squared deviations. A linear function is a special type of ____ function in which a 2 = 0. It follows then: the best quadratic function must be _______as good as the best linear function.

WHICH CURVE TO FIT? If a quadratic function is at least as good as WHICH CURVE TO FIT? If a quadratic function is at least as good as a linear function, why not choose a more ____ form, thereby getting an even better_____? In practice, _______of the form (with only a single independent variable for illustrative purposes) are often suggested: y = a 0 + a 1 x + a 2 x 2 + … + a nx n Such a function is called a _____of degree n, and it represents a broad and flexible class of functions. n=2 quadratic n=3 cubic n=4 _______ …

One must proceed with _____when fitting data with a ______function. For example, it is One must proceed with _____when fitting data with a ______function. For example, it is possible to find a (k – 1)-degree polynomial that will _____fit k data points. To be more specific, suppose we have seven _____observations, denoted (xi , yi), i = 1, 2, …, 7 It is possible to find a ______polynomial y = a 0 + a 1 x + a 2 x 2 + … + a 6 x 6 that exactly passes through each of these seven data points.

A perfect fit gives ______for the sum of squared deviations. However, this is ____, A perfect fit gives ______for the sum of squared deviations. However, this is ____, for it does not imply much about the _____ value of the model for use in future forecasting.

Despite the ____of the polynomial function, the forecast is very_______. The linear fit might Despite the ____of the polynomial function, the forecast is very_______. The linear fit might provide more _____forecasts. Also, note that the polynomial fit has _____ extrapolation properties (i. e. , the polynomial “_____” at its extremes).

One way of finding which fit is truly “better” is to use a different One way of finding which fit is truly “better” is to use a different standard of_______, the “mean squared error” or MSE. sum of squared errors MSE = (# of points – # of parameters) For the______, the number of parameters estimated is 2 (a, b) 5854 MSE = = 1951. 3 (5 -2) For the quadratic fit 4954 MSE = = 2477. 0 (5 -3)

So, the MSE gets ______in this case even though the total sum of squares So, the MSE gets ______in this case even though the total sum of squares will always be less or the same for a ______fit. When there is a_____, both the total sum of squares and the MSE will be_____. Because of this, most forecasting programs will fit only up through a _____polynomial, since higher degrees don’t reflect the general trend of ______data.

What is a Good Fit? A good historical fit may have poor _______power. So What is a Good Fit? A good historical fit may have poor _______power. So what is a good fit? It depends on whether one has some idea about the _____real-world process that relates the y’s and x’s. To be an _____forecasting device, the forecasting function must to some extent capture important ____of that process. The more one knows, the _______one can do. However, knowledge of the underlying process is typically phrased in_____ language. For example, linear curve fitting, in the statistical context, is called_______.

If the statistical _______about the linear regression model are precisely satisfied (e. g. , If the statistical _______about the linear regression model are precisely satisfied (e. g. , errors are _____distributed around the regression line), then in a precise and welldefined sense, statisticians can prove that the linear fit is the “______possible fit. ” In the real world one can never be completely certain about the ______process. The question then becomes: How much ______can we have that the underlying process is one that satisfies a particular set of statistical______? Fortunately, statistical analysis can reveal how well the _____data do indeed satisfy those assumptions.

And if it does not satisfy the assumptions, then try a different____. Remember, there And if it does not satisfy the assumptions, then try a different____. Remember, there is an underlying real-world _____and the model is a selective __________of that problem. How good is that model? Ideally, to test the goodness of a model, one would like to have considerable ______with its use. If, in repeated use, it is observed that the model performs well, then our confidence is____. However, what confidence can we have at the outset, without experience?

Validating Models One_____, is to ask the question: Suppose the model had been used Validating Models One_____, is to ask the question: Suppose the model had been used to make past decisions; how well would the firm have fared? This approach “creates” experience by ____ the past. This is often referred to as _____of the model. Typically, one uses only a ______of the historical data to create the model – for example, to fit a polynomial of a specified degree. One can then use the remaining _____to see how well the model would have performed.

End of Part 1 Please continue to Part 2 End of Part 1 Please continue to Part 2