Calculating Median Absolute Deviation And Modified Z-score?
Jan 31, 2014
I'm working with a relatively small data set, so in order to better identify outliers I'm looking to calculate the modified z-score. However, in order to do that, I must first calculate the median absolute deviation (MAD). how to do this AND show an example of how to calculate it in Excel?
I have a set of % score values (e.g. 88%, 94%, 82%, 67%)
I would like to know if there is a relatively simple formula that can be used to calculate the AVERAGE DEVIATION from 100%, as opposed to the AVERAGE DEVIATION from the mean, which is what the excel function =AVEDEV will return.
E.g. The Average Deviation from the mean for the the above set of values is 8.25%, whereas the Average Deviation from 100% is 17.25%
I can create a complex manual formula which sums absolute values of the variation from 100% of each score, and then divides by the count total of the number of scores, however this is flawed for several reasons, including the fact it is very time consuming when used across a large set of scores, and it is corrupted when a non score exists in a cell (some of the 'scores' may be '-', in which case they should be ignored and not included in the calculation of the average deviation from 100% by treating as a 0% score (and a deviation of 100% from 100%).
I am trying to calculate a Robust mean and a robust standard deviation.For Example how to calculate robust mean and robust standard deviation on "Calcium" on the document here: [URL] .......
I have a list of hotel IDs in one column of my spreadsheet. In the next few columns, I have ratings that customers gave the hotel for cleanliness, location, room, etc.
What I need to do is calculate the median of all the ratings for each specific hotel, in a separate column.
I have a list of ages 10-65 and then different number of participants associated with each age, ie: 10 - 5,071, 11 - 6,069, 12 - 8,465, etc. to age 65. I am try to calculate the median age of all participants.
I have a spreadsheet with Data in Columns A-H. Column B is an ID value that will repeat an unknown amount of times. For each Value in Column B I need to calculate the Median, Mean, and GeoMean for the corresponding range of "G_:H_"
Ex. Column B Column G Column H
2 10 5
2 13 9
[code]....
I need the Median, Mean, and Geo Mean values to paste in Columns N-P for each different Station Index. My code only calculates for the first Station ID
Here is the code for what I have so far
Code: Sub Median() Dim r As Long Dim stndx As String Dim i As Long Dim x As Integer Application.Calculation = xlCalculationManual
I have a list of part numbers which repeat when there are multple prices they were sold at. For each unique part number I need one median price. The list if of about 500 parts but with the various different prices the file is 3700 rows. How can i do this quickly? For each unique part number I need one median price.
We are trying to find the median of a large set of numbers to calculate the median income in 2010. For an example we have 8,379k people with $2500 average income, 9,783k with $7500 average income and so on. How can I calculate the median average income of such a large amount of entries?
I want to see if i can make a formula that works out how a player got a score based on a scoring system. See the attached file for more details. I want this to show that the player scored or didn't score, played a full match or was a substitute, was booked or wasn't booked and was sent off or wasn't sent off.
I have a simple count formula which is suppose to be absolute. =COUNT($L5:$V5) and works till I add a column via a macro. Macro adds a new column at column M then inserts data. My problem is when I check the formula after macro it changes to =COUNT($M5:$W5) I thought using the $ sign means Don't change?
how to get either the absolute row or the absolute column of a cell.
Suppose the active cell is H27. If we use:
ActiveCell.Address we will get $H$27
How can we get either $H or $27 only assigned to a variable let it be MyVar.
Please keep in mind that converting the $H$27 to a string and then making string manipulations is not accepted, unless of course it's the only way to go.
Also it would be nice if I get answers on how to get the relative address column/row portions only, too. Like H or 27.
Another tip is that using .Column returns a number, not a letter.
I'm looking for the proper way of determining the success of a forecast over time.
Forecast ranges from -25% to +25%.
I don't want an average, as it would return a 0% deviation over 2 periods. I don't want a standard deviation as it would be deviation from an eventual mean greater than 0.
I want the result of the above to say that my average deviation is 25 basis points. Meaning I'm either high or low by 25 points or that I'm "usually" off by +/- X%
I have a list of 187 countries and I would like to find a way to do this with an array and not by hand. Attached here is an excel with a sample; I would like to compute the standard deviation of the variable for the period 1980-2008 for each country separately. Is there a way to do this without me having to select by hand the range corresponding to each country in the st dev formula?
I am trying to learn VBA and started reading Wallenbach's book and I thought the best way to get up to speed fast was to work on some real problems.
So this is what I am trying to do.
I have a list of about 1000 stock price returns in the following format
jan 1 2008 .033 jan 2 2008 .01 jan 3 2008 -.002 jan 4 2008 -.0033 jan 5 2008 .04
What I want to do is calculate in VBA a moving 3 day standard deviation of returns. So I want the first calculation to include jan 1 - jan 3 and place the answer in the cell to the right of jan 3 return. Then move down 1 row and calculate the standard deviation using jan 2 - jan4 and place the answer in the cell to the right of jan 4 return...and so on until the last cell.
The spreadsheet below shows the Date (Column A), Percentage (Column B), and Standard Deviation (Column C). What I would like is a macro that finds the standard deviation for all percentages of a given date, and for the output to be placed in the first cell of Column C of that given date. If a cell in Column C is not the first for a given date I would like for it to be blank.
Date Percentage Stan Dev (Output) 8/19/09 26.75% 0.77% 8/19/09 26.79% 8/19/09 28.14% 8/19/09 26.57% 8/19/09 26.40% 8/19/09 25.54% 8/19/09 26.61% 9/16/09 36.54% 3.04% 9/16/09 37.27%
I have an excel spreadsheet with multiple file names (including file path) listed in a column. I have to update other files using these 'underlying' files. So each cell has the file path and file name (i.e. Q:FolderFile Name.xls). I would like to have the adjacent cells indicate the last modified date for each file. This saves me time from looking for the last modified date manually across different folders in my servers to see if the 'underlying' file has been updated, which I would need in order to update my other files.
So this function would be something like "get last modified date based on file path". I will need step by step instructions as I am not very familiar with VBA.
Example:
A B 1 Q:Folder1File name1.xls 2/10/2012 2 X:Folder2File name2.xls 12/1/2011
I am sure this is easy but i cant for the life of me find the code for this:
I am trying to have a MSG Box popup after opening a file and it displays the date the current file was last modified with an "OK" button on it to close the box.
I have a number of excel files in a directory that require me to take action after a file has been saved. What I want to do is create a macro that will look to each file and retrieve its 'modified date', then put that into a spreadsheet. What would be ideal is to create a User Defined Function to look at the last modified date of a file in the cell next to it (I have the files names in a spreadsheet as a hyperlink).
I have a dynamic range that I need to be able to calculate the standard deviation for. The dynamic range sometimes has blank cells and #N/A cells which I do not want included in the calculation. However, there are sometimes cells with the number 0 in it (instead of blank) which I do want to include. I have come up with an array formula, but the standard deviation it calculates is not correct. The formula is:
=STDEV(IF(ISNUMBER(A3:A20000)*ISNUMBER(A3:A20000),1,0)). *This is an array formula calculated by pressing CTRL+SHIFT+ ENTER. The number it calculates is 0.43, however, this is incorrect. If you just calculate the standard deviation of the actual data points in excel (cells A138:A5055 in this case) : STDEV(A138:A5055), it returns 0.17. This is the correct standard deviation.