Data Preparation For Statistical Analysis
Jul 8, 2014
For my thesis I collected data of football players. I have the data in an excel file, but it is unstructured. Due to the nature of the dataset, I don't want to sort it out by hand. The result should be that the dataset is ready for statistical analysis. I have three categories, namely; Player_Performance, Player_Profile and Transfer_History. I will discuss them now.
Player Performance
In this excel file the performance data of the player is gathered. As you can see, the player ID and the player name are a row above the performance data in different competitions. This data should be on the same row. Player ID, Name and then the performance data. How can I achieve this?
Player Profile
The player profile file is the easiest. I want to delete the double information. Some players are 2 times in the file. I want to delete every row which doesn't start with a player id. I also want to remove player id's which arent followed by player information.
Transfer History
The transfer history is the same as player performance. In the first row you will find the player id and name, and beneath are the different transfers. I want to put every transfer in a row of their own. The row should start with player id, then name and then the transfer. So it is possible that one player id will be on more rows.
In the attachment I put the files as they are and three files of how it should look like.
Player_Performance_SMALL.xlsx
Player_Performance14_EXAMPLE.xlsx
Player_Profile_EXAMPLE.xlsx
Player_Profile_SMALL.xlsx
Transfer_History_EXAMPLE.xlsx
View 1 Replies
ADVERTISEMENT
Jul 13, 2013
Does Excel can be used to automate the preparation of the Quad Analysis?
View 2 Replies
View Related
Aug 7, 2009
I need to write a program that will take numbers from various sheets, do a function on them (specifically a statistical analysis), and print that number to a cell on one sheet. I've tried to have all of the numbers go to one cell and have the analysis done on them, but after so many sheets, I get a memory error. I don't necessarily need all of the numbers to be saved, just the result is what I need.
View 14 Replies
View Related
Mar 22, 2009
using Excel 2007, windowsXP, and am trying to find the right formula for the following problem.
I have a column of 3 digit numbers, and want to determine how many times a number occurs in the first position ie: 123 the second position ie: 123 and the third position ie: 123 in the entire column of figures?
View 9 Replies
View Related
Jun 26, 2014
I was working with formula array
I have named ranges
Graphdaterange has all of my dates
GS1 has the value I want to analyze
Ultimately I want to look at the last say 10 weeks of Mondays, Normalize the data removing high and low values, finally perform Sum, Min, Max, STDEV average total. formula array would work for me but when I try to use it the "And" does not appear to work correctly
I can settle for just statistical analysis of Mondays for the last 10 weeks so If normalizing makes this impossible we can eliminate the large portion of the formula.
Named ranges
GS1_1 - D14-D(X) as defined by a formula is my data
GraphDateRange - C14-C(X) as defined by a formula is the column containing my dates.[code]...
I am not using a macro due to the nature of the tool. I can not get iterative processes to run near as quickly a formula array.
View 3 Replies
View Related
Nov 22, 2006
in excel, is there a place called data analysis under tools? im an exchange student in sweden andthe work that teacher gave me is half swedish and half english.. im looking for histogram there.
View 3 Replies
View Related
Nov 23, 2007
I have built a reporting dashboard to track emplyee efficiency .Within this dashboard i have produced a point scoring league table . I am lookinf for a forula that will automatically work out who my top performers are and arrange them in them in the form of 1st , 2nd ,3rd etc . I believe the answer to lie in the form of a pivot table , perhaps with a macro to populate the data , but would be grateful for come direction
View 3 Replies
View Related
Jun 5, 2007
in 1 workbook i have 2 sheets (sheet1 and sheet2) with same format of data that I need to compare and compile in sheet3 by a primary key column.
example:
sheet 1 & 2 data format:
|Atr1|Key|Atr2|SubAtr1_1|SubKey1|SubAtr1_2|SubAtr2_1|SubKey2|SubAtr2_2|Atr3|
..................... ......................................../......................................../
......................................Set1.....................................Set2
To be more specific I can compare this with an electrical wiring table, where:
Key is the wire number
Atr1, Atr2, Atr3 are general atributes of the wire number
Set1 are the atributes of the first end of the wire, called "from":
SubKey1 is the "from" device and its subatributes SubAtr1_1 & SubAtr1_2
Similar for the other end of the wire is Set2.
This means that Set1 and Set 2 have similar format & type of data.
I have found on your site an example of macro that is doing the comparision between the 2 sheets of data but I encountered a problem...sometimes on sheet2 Set1 and Set2 are reversed and are highlighted as a difference but I don't want this.
Please help me with macro that would create a rearranged copy of sheet 2 respecting the following conditions:
1. if Key from sheet1 cannot be found on sheet2 then copy entire row on sheet3.
2. if Key from sheet2 cannot be found on sheet1 then copy entire row on sheet3.
3. If Key from sheet1 = Key from sheet2
If SubKey1 from sheet1 = SubKey2 from sheet2
& SubKey2 from sheet1 = SubKey1 from sheet2
Then reverse Set1 with Set2 by swapping cells
and copy entire row (with Set1 and Set2 reversed)to sheet3.
All data on sheet 3 should be sorted by Key.
View 7 Replies
View Related
May 3, 2007
i want to run data analisys(Rank and percentile) on each row in my sheet and put the results on a new sheet
(the reuslts are 4 rows ) My problem is some how the command of the run analisysWon't run the command
kStop = . Cells(1, Columns.Count).End(xlToLeft).Column
Sheets.Add
ActiveSheet.Name = "Analisys_Results"
Sheets(" Total time data").Select
Var = 4
Var1 = 8
For ILoop = 1 To kStop
If (ILoop > 1) Then
Var = Var + 4
Var1 = Var + 4
End If.....................
View 4 Replies
View Related
Sep 29, 2007
I am trying to do some analysis on montly bank account data. To do this I need to take the bank statement information for the month and put it into a table that shows every day of the month (see attached example).
I am currently doing this manually but I'm sure there must be a smarter way of doing this.
View 3 Replies
View Related
May 8, 2014
Data Analysis . I have 7 tables gathered in one spreadsheet but in seperate sheets. Tables have the same columns but different data baceause they come from different locations. Now I would like to create pivot table to combine data from all 7 tables. I read a little and I got to know that relationships is needed between the tables but I can't create it because I don't have unique data in the table. Usually tables consist repeating names, locations, some numerical data.
For example every table consists worker's name. I create pivot table from one of 7th starting from worker's name in row field and put some data in value field. Then I add another workers from another table. I can do it but when I put in row field I have workers from second table under those from first one. And of course data are badly calculated. I would like to have all workers in row field with proper data. It means one column with all workers from all 7 tables an then some their data
I know I can copy all of them into one big table and then create pivot table but this is solution what I wanted avoid. I know it will work but these data can change from time to time and it will be very hard to maintain spreadsheet. When data will change I will have to copy manually .Is there any solution to create such a pivot table from multiple tables sources but with the same column structures?
View 1 Replies
View Related
Feb 11, 2010
I need to extract for each individual dive: Onset/start of dive (when the whale has completed its surface interval and begins its descent)
Rate of Descent (meters/second)
Onset/start of bottom phase (the whale has stopped descended and begins the "foraging" phase of the dive...not necessary the deepest point reached)
Number of lunges (you can see these if you plot the data. They are lunges at depth indicative of feeding behavior)
End of bottom phase (begins to ascend)
Rate of Ascent
End of Dive (surface intervals begins)
I've attached a brief sampling of the data in case you're curious.
Over the years I have identified these points through visual inspection of Excel charts. It's very easy, but extremely tedious and not at all efficient. I am interested in expediting this whole process and am more than willing to learn programming (i.e. visual basic or the like)...but only IF it is the best approach for this application.
I have attached two dives worth of data, so we can see what the data looks like. Each data set can contain from 1 to 8 hours worth of data.
View 14 Replies
View Related
Nov 20, 2009
I have student demographic data, then I have to track how each student did on each question, but each question also corresponds to a reading standard and I need to pull two pieces of info from that. How they scored on the assessment (# of answers correct), and if they missed a certain standard twice or more (There are 4 instances of StandardA in the test and they had wrong answers on 2 or more of them). I have the solution to the first part. I entered the results of each question as a 1 for correct and 2 for incorrect then did a countif and only counted the 1s. I am stumped on getting an analysis of how each student did on the indicators. Since the indicators can occur more than once, generating a pviot table doesn't work in this instance, it'll count of 1s and 2s for StandardA, StandardA2, StandardA3, etc. and not the counts of 1s and 2 for all instances of StandardA.
Example:
What I want:
For student Bob - Number of times he answered incorrectly StandardA, StandardB, StandardC, StandardD, etc.............
View 3 Replies
View Related
Sep 16, 2013
Each month we receive an updated budget which includes info for every month of the budget year. I want the user to be able to select the column (which relates to a month) that they want to use for the analysis. I will use the data in this column only for analysis. How can I do that?
View 3 Replies
View Related
Jul 16, 2009
the sheet is very long and monitors the sales activities of each individual every day extending vertically down the sheet until the present day.
my first priority was to get the information recorded but now i am wanting to use it.
how would an expert arrange this data to make analysis easier going forward?
ultimately i will be wanting to look at individuals, groups of individuals, certain types of activity etc within various time periods.
View 9 Replies
View Related
Mar 16, 2014
Below are the weekly closing prices for a stock market index. I'm looking to use conditional formatting to highlight in green "buy" signals and in red "sell" signals.
Buy Signal: If the index rises 4% above any previous low point.
Sell Signal: If the index falls 4% from any previous high point.
DateAdj Close
3/10/201472.16
3/3/201473.55
2/24/201472.83
2/18/201471.92
2/10/201471.43
[code].....
In the above example a Buy Signal is generated on 1/13/2014 because the market closes 4% above the low of 68.24 on 12/9/2013.
[URL]
View 4 Replies
View Related
Apr 10, 2009
I'm trying to write a macro that will analyze data from one spreadsheet and do a regression. The information I want to be output on the same sheet. I tried to use the record function, but I got an error. It said "Run-time error '1004': ATPVBAEN.XLA could not be found. The code read:
View 2 Replies
View Related
Jul 1, 2009
I am having issue setting up an analysis page for a report that I have. I am trying to find out for each individual what are the currently working on, completed, and the average time it took to complete. I have the data but it is in the following format:
NameStartedDone
Julie24-Jun27-Jun
Tom24-Jun
Julie24-Jun27-Jun
Julie24-Jun
Ken24-Jun27-Jun
The way I need the data appear with the name above with the data below as shown below.
JulieTom
Working11
Complete2-
Avg Time3-
I am having issue because the data isn't setup in a column format. Also the data list is constantly growing so I want it to be something that can update itself so I am not constantly updating the report.
View 10 Replies
View Related
Nov 18, 2011
Is it easier to use the Solver Add-in than to create your own data analysis code? We are planning to design a scheduler for around 12 technicians in our contact center (doing phone support).
View 5 Replies
View Related
Oct 20, 2013
I've used the following formula successfully to pull data from multiple sheets and provide the total on an analysis worksheet. Example of worksheets and formula.
=DSUM(INDIRECT("'"&A2&"'!D:D"),$B$1,INDIRECT("'"&A2&"'!D:D"))
Example of one of the individual sheets holding original data:
A
B
C
D
[Code].....
Example of what I want to see using the Transpose formula (Analysis sheet):
=TRANSPOSE(INDIRECT("'"&A2&"'!C:C"),$C$1,INDIRECT("'"&A2&"'!C:C")) (My theory that currently doesn't work. I'd like to be able to pull data across worksheets and yield the transposed data into the corresponding row of the same merchant row.
A
B
C
D
[Code].....
View 9 Replies
View Related
Oct 4, 2006
i am preparing my self to enter the microsoft Excel 2003 Exam.....
one of the mentioned points in the preparation list is: (Perform data analysis using automated tools) how to use the automated tools? and if possiable to get a file that i can apply these tools on it
View 2 Replies
View Related
Apr 16, 2006
I have a data base for a stock.
It goes like this:
Date Price
Date1 Price1
Date2 Price2
How can I do an analysis of the:
Worst 12mths performance (like how much is the maximum a stock drop in any rolling 12mths period)
best 12 mths performance
View 4 Replies
View Related
Aug 2, 2008
I want to set up an automatic sensitivity analysis such as the one shown in example.doc attached, using my model that is attached (inventory model. zip). Going horizontally in the sensitivity analysis, I want to show 2H (3Q and 4Q) Sales Growth (30%, 35, 40,45,50,55,60%) and vertically I want to show 2Q Inventory Growth (50%, 55%, 60%, 65%, 70%, 75%, 80%) and I want the inside data to return sales/inventory spread ( multiple). Basically it would divide the dollar growth in sales in 2H by the dollar growth in inventory and return a multiple. Is it possible to set this up automatically using the data that I have here?
View 2 Replies
View Related
Jun 22, 2009
I'm trying to analyse data and summarise it by date (greater than 60 days, and between 30 days and 60 days), by department using the first two digits of department codes, and where there are multi-line order numbers.
View 4 Replies
View Related
Dec 8, 2013
We are running Office 2010 part of Office Professional Plus 1010 (corporate install). We have added both the Analysis Toolpak and Analysis Toolpak - VBA to the add-Ins and have the button on the toolbar, however when we click on the button, we don't get the popup window allowing the user to pick the analysis tool they want to use. Have tried this on a number of PCs (we all run the same version) and it doesn't work on any of them.
What we have to do do make this toolpak work?
View 8 Replies
View Related
Feb 13, 2007
I've attached the worksheet i'm working on so you can have a look. Basically, my problem is that every morning, data is pulled into the sheet in this form. The bosses want to be able to manipulate the data in Excel as easily as possible. For example, some would like to have the sum of HDD for a specific period of time for all weather stations, while others would like to have the daily values for one specific weather station.
To give you an idea of their skill level, I think that pivot tables would be too hard for them to do, and I personally dont think it would be flexible enough. Is there some way that I could write a VBA code and have UserForms that they could easily click and get the weather station they would want, the dates needed, and whether they want the HDD to be summed?
View 5 Replies
View Related
Apr 25, 2014
So I have something like this:
Pre-test || Post-test
45,5 % || 27,3% (<-- percentage of failure)
Is it possible to calculate the statistical significance if have only these two percentages?
View 1 Replies
View Related
Feb 11, 2008
columns A,B and C contain the details of each event(race).Column d is the runners in each event and column E a score for each runner. I need to add 5 extra columns with the total score for each event,the maximum for each event,the average for each event,the meadian for each event and the standard deviation for each event. I need all cells filled. see attached spreadsheet. I will have thousands of events.
View 6 Replies
View Related
Mar 14, 2013
My company uses financial analysis software to evaluate data on a monthly basis. The software has a report writing package that is similar (but not exactly like) Excel. There are 2 functions of this report writer that I would like to find out if
Assume A1=4 and B1=5 (but the data may vary in the future). In C1, I want an green up arrow if B1>A1. If B1
View 1 Replies
View Related
Jun 27, 2014
When the user choose a customer from a combo box then i need to get some statistical results in the text boxes, such as last amount of credit-charge, last date, and amount that he has to pay us until now.
When user adds a new amount in the grey textbox then this has to be added in the previous amount.
I have the expected results of my example in my sample sheet.
View 11 Replies
View Related