I am attaching a file with an example of a spreadsheet that I am trying to sort out. In this example I have 3 samples (I could have many more). Each sample has 8 columns associated with it (N, M, I, F, S, MS, KM and KD). The length of the dataset is different for each sample. The MS column is the same as M but contains a few zeroes. What I am trying to do is:
1) generate one column (MSA) containing only unique values (no zeroes) from columns MS1, MS2 and MS3. The unique values should be selected within a specified tolerance (for example, 0.001, which makes 52.00706 from MS1 and 52.00701 from MS2 duplicate values although they are not exactly the same)
2) generate 3 columns ( named SS1, SS2 and SS3) with sorted columns S1, S2, and S3 so that for each value of MS in column MSA each of the three columns will list the corresponding value of S1, S2 and S3 (zero if there is no corresponding value)
I have a huge list of accruals and payments. Accruals (positive) are entered, and at a later date are offset by the payments (negative). Im trying to make a schedule so i can determine which are left over.
This is easy to do manually for a small amount of rows. However im dealing with 5000 rows and i do not want to manually match it would take many days to do.
Ive tried a duplicate remover. To get it to work i made an absolute value column for the negatives, and compared it to the positives column to find the duplicates. This works to a point. However, If i have three accruals for 100, and one payment for 100, all are identified as duplicates which obviously is not what im looking for.
I need to get it where one accrual is matched to one payment. if there are 3 accruals, 2 payments, 1 is not a duplicate. if there is 3 accruals, 1 payment, 2 are not duplicates.
I have a list of serial numbers. There are many groups of 8 same serial numbers, where group consists of 2 projects with 4 SN per project (because of 4 different events). I want to create a formula that marks one project with 4 out of 8 SN for deletion based on set of 3 dates assigned to them. In short I need to count 4 rows per project as one unit.
Serial Project Event description Date1 Date2 Date3
Here is a list on what information matters when making decision if to mark project for deletion or not.
1. Project1 has no Dates entered compared to Project2. Mark Pr1
2. Project1 and Project2 has no Dates entered. Mark Pr1 (random, does not matter which should be removed)
3. Project1 has 2009 Dates, Project2 has 2011 Dates. Mark Pr1 because dates are older
4. Project1 has less Date entries filled than Project2 (same year). Mark Pr1 because less Date fields entered.
I can somewhat do it for separate rows, how I can make these rules apply for whole project as one unit related only to one SN at a time. Biggest problem is there is no pattern of dates entered. Sometimes one row can be filled another missing out info and etc.
I am trying to merge rows of duplicate entries, say I have the following information in a excel spreadsheet:
Column A: Name Column B: First Line of Address Column C: Area Column D: County Column E: Post Code Column F: Tel Number
If I have the data above in my spreadsheet I want to basically search and look at columns B,C,D,E and F and then if there are any duplicates of these for that row anywhere on the sheet I want it to merge the Data for those duplicate as shown below:
A|B|C|D|E|F Dave, Mark | Rice Rd | Liverpool | Merseyside | L45 7HT | 6381754 Tony | Hill Grove | Runcorn | Merseyside | L78 9JU | 6527897 John | Lime Ave | Runcorn | Merseyside | L34 9HF | 7248853
I have managed to find a macro that is shown below however this macro only looks at 1 column and then merges the data based on that which is not what I want
i have a wordlist containing about 0.5 Million words in col.A and Millions of meanings in Col.B. Now in Col. A there are many duplicates. I would like to make one cell of all duplicates in col.A and put their meanings together in B with seperator ";". Its very important that there are no duplicates in Col. B.
For example: Col. A Col.B w1 m1;m2;m3 w1 m1;m2;m5 w2 m23;m24 w2 m50
Now after running the macro the list should look like:
Col.A Col.B w1 m1;m2;m3;m5 w2 m23;m24;m50
The words in col. A are alphabetically ordered and all the duplicates are followed by each other.
I have a worksheet containing of column A and B. I want to remove the duplicate words and merge their meanings without having duplicates in merged meanings.
I want to remove the duplicate words in Column B and merge their meanings. The new wordlist should be then in Col. D and the merged meanings in Col.E:
Result should be:
Col. D...........................Col. E w1...............................m1;m2;m4 w2...............................m3
I have a code for this problem but unfotunately it allows duplicates in E. I will be very thankful if some could help me out with that. I want the code not to allow duplicates in column E.
I have data on 400 rows. Each row has a maximum of 10 cells with data, but many have empty cells with no data. I would like to sort each row to show values of cells in sequence and eliminate empty cells. I can use the sort row function but its a long process for 400 individual rows. Is there an easier way?
We receive huge Excel files and we need to validate for and delete duplicate records before they are imported to a software product called Exceed Premier. We are having a difficult time with Excel spreadsheet duplicate record validation because we have to first export the files from the Exceed database into an Excel spread, merge the thousands of other new records from multiple Excel files, then import back into Exceed.
Is there a method in Excel/VBA that can merge several worksheets and check for and delete duplicate irecords? The records will be in the 20-50K range and growing.
I have got a wordlist in worksheet "original" which looks like:
Before: [Code] .....
I need a macro which merges the columns B,C,D,E and F depending on if there are duplicates in Column A or not. If there are one two or more duplicates in column A,then those should be deleted and only one of them should remain in column A. The members of deleted duplicates in column B,C,D,E and F should be merged together. No duplicates should be made by the process of merging. Each member in column B,C,D,E and F has to be unique. The results are supposed to be put in worksheet "new".Columns B,C,E and F should be merged through signe "/". And Column D should be merged through signe ",".
After: [Code] ......
The macro must be able to deal with very large lists. biger than 200 000 words in column A
Here is the excel file containing the example : excel file
I have two name/address lists in separate Excel workbooks. I need to merge and eliminate duplicates by name. One has single name field in format "Lastname, Firstname". The other has a single name field in "normal" format (First and last name). Ultimately I need to end up with two name fields, First and Last, and this I can do.
The problem is in identifying any possible duplicates between the two lists. Why? Because the first list will have names like "Smith, Robert" and occasionally maybe "Smith, Robert J.". The second list will have names like "Dr. Robert J. Smith Jr. PhD", as well as possibly "Robert Smith" or "Dr. Robert Smith".
Using Text-to-Columns will still require a lot of reworking, because the last name won't always be in the same column, depending on whether there's a Dr. or Mr. or middle initial and so forth.
I've looked into the LIKE( ) function which holds some scant hope, but I don't think it will happen for me either.
How I could compare "Robert Smith" to "Robert J. Smith" or "Dr. Robert Smith" or "Dr. Robert J. Smith Jr. Phd"? Imagine all names in column A.
Column A______Column B_____Column C 100/12__________B___________$ 100/12______________________@ 100/12______________________€ 250/13______________________€ 250/13______________________$
I want to keep in ColumnA all three rows of 100/12, because it has a value in Column B in one cell-which is the criteria, and remove the 250/13 because it has no value in cell B.
I was assuming that merging duplicates in column A, and than remove empty from ColumnB.
I am trying to manipulate some worksheets during a Workbook_BeforeClose event. On sheet names that end with "sd" they need to alphabetize the data and autofit the columns. All sheets should hide on close, except "order". This is what I have but warning - it did something bad to my workbook. I can't get it back again. Had to force close Excel.
So I have several columns of data that have a location and then some numbers after it in additional columns. I need to be able to sort it so that all locations that are in both location columns are sorted first, and then any locations that are in one column but not the other column follows. Also the data associated with each column that follows the location needs to remain next to the location. Many times the columns will be of highly disportionate lengths too.
I've attached an HTML table so you can get an idea of what I am looking at, except what I'm dealing with is like.... hundreds of cells long.
And in this 6 row example, I'd actually only be interested in the 4 letters that were in both columns. If it makes it easier, I would be okay with discarding the last rows, but would rather just have them sorted at the end, so I could check if....for example there were two different spellings of the same thing, and they should actually be included.
I need to create a BOTH box that will dynamically pull names from both the "Chicago Office" & "Seattle Office" columns in my sheet. It is extremely preferable to do this without VB Script if possible to avoid the security warnings on opening the sheet. (Our IT department will not budge on this..)
Detail: I have two lists of employees. Column A lists the Chicago Office employees. Column B lists the Seattle Office employees. I've been able to successfully define named ranges to work with these as dynamic lists. I can append names, or delete names, and the Chicago, or Seattle boxes (drop downs created with data validation formulas inside named ranges) will reflect the updated names correctly. I have been unable to make a single drop down with all the names from both offices, that is updated dynamically. I've attached a sheet so that this is easier to understand. Basically I need the "BOTH" drop down to actually work.
The Chicago range is defined as: =OFFSET(Sheet1!$A$2:$A$11,0,0, COUNTA(Sheet1!$A$2:$A$11),1) The Seattle range is defined as: =OFFSET(Sheet1!$B$2:$B$11,0,0,COUNTA(Sheet1!$B$2:$B$11),1)
In the boxes on the right of the lists, I just have a Data Validation List formula as: =Chicago and the other as =Seattle
The code below is supposed to sort and remove duplicates in the sheet in column A. So far it sorts, but I get an error when it tries to remove duplicates. I get an invalid object error.
Code: Sub sort_And_Remove_Dups_Top5_Open_And_Open() Dim lngLr As Long With Sheets("Top 5 Clients")
I have a spreadsheet with 7000 lines exported from a database. I'm looking for lines that exist with an @@2 that don't have a corresponding @@1. Let me explain.
SV10000000@@1 SV10000000@@2 SV10101000@@2
I want to keep the first two lines because there is an @@1 associated with an @@2. I'm looking to single out and delete lines that have @@2 that don't have a corresponding @@1 associated with it.
I am currently using the following to populate the choices in my ComboBox:
For Each rngX In Range("Watchlist_Source_Menu").Cells If Len(rngX.Value) > 0 Then ComboBox2.AddItem rngX.Value End If Next
The range "Watchlist_Source_Menu" is p4:p104. It works fin but if there are duplicates in that range it duplicates the choices in the ComboBox. Is there a way that I can only show unique choices and show them alphabetized in the combo box upon intializing the userform?
I would like to have the duplicate numbers in position one ( cell D9:D18), position two (cell E9:E18) and position three (cell F9:18) all counted and then ranked based upon date range in column A.
The number order of precendence is 0-9 with zero being the lowest.
Column (I9:I18) counts the duplicates, draws and no draws of digits 0-9. Column (J9:J18) converts results from column I9:18 into a percent.
I need to add a sort button at the bottom of position 1 (columns H & I 22), position 2 (columns L & M 22) and position 3 (columns P & Q 22)
When the sort button is selected it will sort the data by # of repeats, draws, and no draws as illustrated in (columns I, M & Q) or sort data by Position number from 0-9 as illustrated in (columns H) which will then show data of repeats and percentages out of rank order, because it is in order of number precedence.
Last but not least a graphic bar chart that shows the results ranked highest to lowest from when the sort button is selected with the number at the top of the bar colored black. Bar graph colors to be: top 3 red, middle 4 yellow and bottom 3 blue. This should show a vertical cascade effect.
Currently use this macro (see below) but occasionally I have more rows than 842. I could increase to 10000 and this would work but looking for a more automated macro that will automatically remove duplicates from all rows i.e. a macro that works out all rows.
Sub copyTab() Cells.Select Selection.Copy Sheets("filtered_data").Select Range("A1").Select ActiveSheet.Paste Columns("A:A").Select Application.CutCopyMode = False ActiveSheet.Range("$A$1:$J$842").RemoveDuplicates Columns:=1, Header:=xlYes End Sub
I have a list of parts in a table. I am manually copy/pasting this list to another sheet in my workbook and then using DATA→Remove Duplicates to get a shortened list. Is there a way to automate this process?
In the attached workbook I want the yellow cells to auto-fill for me.
I get a lot of lists in the format on the left hand side so I want excel to automatically format them for me so that I can use the output on the right.
It’s been working perfectly up until around an hour ago. However, now when I close Excel, I get compile errors.
These compile errors seem to be because Excel is trying to access controls in the workbook after it has closed. Since the workbook is closed, VBA can no longer ‘see’ the controls, and therefore it thows up errors.
I’m also getting a similar error on a Worksheets("DataExplorer").unprotect line, which seems to be because the worksheet isn’t there after closing.
These errors only occur when I close the whole excel application using the big cross in the top right. If I just close the workbook (using the smaller cross just below the big cross in the top right of excel), it doesn’t throw up these errors.
Just to clarify: all of the code runs perfectly when Excel is open. The errors are being thrown up for lines of code which run without problem until Excel is closed.
A1 - Header - "Holidays in XXXX' B1 - Header - "Holidays in YYYY' C1 - Header - "Leaves by YOU"
A2:B11 have static dates consisting of 10 dates in each column. C2:C11 - the user may enter any date at any point of time.
I would like to auto-merge the dates in all the 3 columns (A2:C11) in a single column say D2:D31 and then the system should auto-sort the column based on dates in any one order. So as soon as the user enters a value in say cell C2, all the 10+10+1 dates should get sorted.
Also the constraint here is a user may not apply all 10 leaves at in a year. So many of the cells may have blank values.
Hello, I am trying to update a person's finances for this past year. She sent me a workbook that has two sheets with macros. The first sheet is linked to the 2nd. When a name and number is typed into the first sheet it automatically shows up on the 2nd.
My problem is, no matter what number I change on the first sheet under amount, the Total cell on the 2nd sheet gives me the #Value error. I have not changed anything, have not deleted any formulas but no matter what I do, that value error always shows up. I know very little about excel but I thought I knew enough to be able to update this book for her. Is there a special way I should be inputting the numbers on the first sheet so that the value error will show the running total instead of an error?
One of the multitude of areas where I still need drastic improvement is with efficiency. Specifically, avoiding loops.
Are any of you aware of any good, basic level sites, blogs, posts, etc., about better methods or processes? I'm not particularly experienced with programming (I've been a software TESTER for 13 years and just started some programming in recent months), so the more basic the better.