Noting Duplicates And Identifying Data Within
Feb 9, 2010
I have attached a sample sheet which deals with property sales data, in reference to a two-part question.
1. If the row has two, or more, rows share the same value in column 'E', it needs to be identified with a 'Y' in Column G. In looking through old threads, this seems possible, though I could not find and answer I could 'bend' to work. If this is possible, can the following be included?
2. If two or more rows share the same value in column 'E', list the identifiers (value in Column A) for the others in Column H, separated by commas "," or slashes (preferred) "/".
ie.
"00370600000700 'NICHOLLS JOHN W & CARLA R 11/27/2000 85000 '260647 W Y '00370600000800" and
"00370600000800 'NICHOLLS JOHN W & CARLA R 11/27/2000 85000 '260647 W Y '00370600000700" or
"'00370500000801 'FRAHM FREDERICK/ERIK/KRYSTYNA 06/17/2004 110000 '288904W Y '00370500000802/'00370500000803"
The sample sheet attached includes 26 rows of data with several 'doubles' and one 'triple' 'duplicates'.
Please note that in the 'real' file, it has slightly less than 200,000 rows and I have seen 'dupicates' up to 40 with the same value in Column 'E'
View 14 Replies
ADVERTISEMENT
Feb 11, 2014
I am involved in a software conversion that is taking 4 full time folks over 5 weeks to clean up and assign an alpha-numeric sequential number to each vendor, client. Each scrubber is reviewing an excel spreadsheet containing the names, addresses, FID, telephone, etc. of our vendors and customers. This information is being pulled from 2 separate sources. We are assigning a BP # to the main office location and not retiring that one. then we go on to identifying the dups. All dups get a Y to be retired, but if they have a different address then the main one, we place a Y to bring that address over under that BP#.
Ultimately, we end up with 2 systems combined into one dumping all old numbers assigned and giving each vendors, customers, etc. a new BP# that may have muliptple addresses.
How can we assign a alpha numeric number without going through each individual line...over 900,000 of them to do. Key is to identify duplicate addresses and duplicate names. Some names might be RK Electrical or Robert King Electrical but the address will be duplicated usually.
View 9 Replies
View Related
Aug 6, 2009
Here is the deal I have 4 columns. Each line gives you the following information:
The Id is the record number, Code_Name is a code for each Fox in the study, date and area is a sub area in a bigger grid. Basically I have an area divided by squares and every time a marked fox enters in one of my squares a new line in the data is created. What I won’t to know is if a fox when in my area of study will return to the same squares or not.
ID
CODE_NAME
Date
Area
Logical
116
SSS1
02-Jan-09
1A1
273
RRR1
02-Jan-09
2A2
2959
BBB1
02-Jan-09
1B1
2959
What I’m trying to achieve is a 5th column were ill get a logical value of TRUE or FALSE if, for each fox in the next available record a fox went back to the same square or not.
So if you check for the fox RRR1 I have 2 records one in 2 of January in area 1A1 and a second in 7 of January in the same area. For the Fox BBB1 you will see that she was always in different areas and for CCC1 she only came back to one square.
The problem is I have over 400 fox’s and 12000 records and I’m trying to get a way of doing it automatically.
View 9 Replies
View Related
Jul 18, 2012
I have organized account numbers from two systems onto a spreadsheet, with numbers from System 1 arrayed in Column A and numbers from System 2 arrayed in Column B. I need to evaluate the numbers in both columns and isolate the numbers that are NOT DUPLICATES across the two systems (Columns A and B) and return a list of non-duplicate numbers in Column C. Here is what the table would look like:
System 1
System 2
Non-Duplicates
173
215
173
[Code] .......
What is the best approach to use?
View 4 Replies
View Related
Nov 30, 2009
I would like to identify duplicates in a list using conditional formatting in Excel 2007.
I have tried choosing to identify duplicates using the formula that I have found on many threads throughout the message board:
=COUNTIF($A$1:$A1,$A1)>1.
This function works up to 15 characters in a cell, but Excel seems to be treating all digits after the first 15 as the same, resulting in a "fuzzy match" where I want an exact match. Many of the values in my list are 18 characters long, in text format to prevent rounding.
I've noticed that Excel treats the 18-characters values the same way when sorting; for example, it treats these two values as the same:
'234567891011121314
'234567891011122413
Is there a way to force Excel to examine those last four digits for the purpose of sorting & identifying duplicates?
View 9 Replies
View Related
May 5, 2009
I've collected some data from a GPS logger regarding the speed of an athlete. I want to calculate how many sprints this particular athlete undertook during a training session.
Sprinting is defined as a speed of > 20 kph.
One sprint would be the attainment of one peak >20 kph before decreasing below 20 kph.
It's easy to identify the 3 peaks and thus sprints from the xy scatterplot in the attached file, but I'm struggling to find a way to calculate this.
View 11 Replies
View Related
Feb 10, 2009
In column H I have a list of numbers seperated by a space, the number of lines can change. In column L I have a list of numbers which can change either expand or retract.
I would like to check each cell in column H and if any numbers are not listed in column L then it/they should be shown in column G.
Example1 H2 shows 6 11, therefore cell G2 should show 11.
Example 2 H6 shows 5 6 9 11 therefore G6 should show 9 11
Sheet1 HIJKL1Container ID26 11 135 8 11 245 7 11 355 7 565 6 9 11 675 6 9 Excel tables to the web >> Excel Jeanie HTML 4
View 9 Replies
View Related
Apr 22, 2009
I am looking for the easiest way to find duplicate Work Order numbers that exist in 2 separate Workbooks. EX. Workbook 1 Sheet one contains the numbers 1-100 in A1:A100
Workbook 2 Sheet one contains X amount of the numbers between 1-100 located.
somewhere in A:A. For arguments sake let's assume those numbers are 3,6,33,87,99.
What would the formula be to return the values that are in both of the workbooks?
View 5 Replies
View Related
Mar 11, 2014
I have small table i would like to create. Now, it can be done manually ( but its be very very time consuming) but im sure of a way using IFs and VLOOKUPs so that the data selection can be done automatically...
so in column 1 i have various valuations from 0 to anything 50mil plus that i need to then separate into 4 different columns based on their size. so column A would have 0 - 250k, column B 251k to 500k, column C 501k to 1million and etc etc...
vals 0-250 251-500 501-1mill
555,000 300,000
150,00 75,000
Please see attached ... testing values 1.xlsx‎
View 2 Replies
View Related
Dec 6, 2006
I am trying to slim down my database results in Excel via MS Query by searching for Part ID's that are numeric (we have parts that also contain letters....I want to weed those out).
In all my searching on the web, I thought the ISNUMERIC() function should be the function for this, but I keep getting an ORA-00904::"ISNUMERIC":invalid identifier....
Is this function supposed to work or is there another function that will do this
This is my SQL statement so far, which works to get parts that are 6 characters long only:
SELECT PART.ID
FROM SYSADM.PART PART
WHERE (LENGTH(PART.ID)=6)
When I change it to this to get parts that are numeric, it gives the error above:
SELECT PART.ID
FROM SYSADM.PART PART
WHERE (LENGTH(PART.ID)=6) AND (ISNUMERIC(PART.ID)=1)
View 9 Replies
View Related
Sep 17, 2009
i am trying to work out how to use the rank formula to rank numbers in column B and keep them in unison with Column A.
So Column A has say 5 1's with column B having different scores then continuing under 1 in A is 2 and so on is there a way to continue the ranking formula without manually changing the cell ranges?
so =rank(B1,$B$1:$B$7,1) but can i do that if A =1 and then A=2 etc ?
so if A=1,rank(B1,$B$1:$B$7,1)
View 7 Replies
View Related
Sep 21, 2009
I currently have a database that is similar to the example I've pasted below:
TypeDateTimeA10/01/200910:00B10/01/200910:00C10/01/200910:00D10/01/200912:00E10/01/200912:00
As you can see the Date and Time are repeated for several rows. This is how the data I receive comes through as A,B and C refer to a single transaction and D & E refer to another transaction.
Ideally what I would like to acheive is:
TypeDateTimeTransactionA10/01/200910:001B10/01/200910:00C10/01/200910:00D10/01/200912:002E10/01/200912:00
Where Excel can compare the date and times of each row and look for matching rows above and below it and then fill in a column next to it indicating that x number of rows are linked to a single transaction - preferably labelling them in some order to I can tell how many transactions there are.
View 9 Replies
View Related
Jun 19, 2014
I am trying to write a formula that shows the best grade for each each student and the subject in which
that grade was achieved.
In a previous post i was shown how to identify the highest grade
MAX(IF(A$3:A$34=A7,C$3:C$34))
I hit CTRL, SHIFT & ENTER to activate the formula - Result in column E
However, I also want to identify the subject in which they scored their highest grade in column F
I have two problems
1. I don't know how to write a formula that brings in the subject based on the grade for each student
2. I don't know how to write a formula in case there is a tie (see DAVID L)
View 11 Replies
View Related
Aug 24, 2011
In short, I would like a pivot table to only count unique values, but when I click into the pivot I would like to show all instances of that value. For example:
I have a table of data that I am creating a pivot table from. There are fields for Customer ID, Task Name, Age, and Notes. There will be multiple records for a single Customer ID each time it has new notes.
I would like to create a pivot table that has Task Name in the Row Labels, Age in the Column Labels, and count of Customer ID in the Values, so that, for example, I can see how many accounts have been in the Design task for 2 days. However, when I do this it counts each record, but I would like it to count each unique Customer ID. Also, when I click into the pivot, instead of pulling up one line per Customer ID, I would like it to pull up each instance of Customer IDs in that Task Name/Age combination (similar to doing a DISTINCT in SQL).
View 3 Replies
View Related
Jan 7, 2014
I have alist in which a location is mentioned multiple time..
eg:
Delhi
Delhi
Delhi
Punjab
Punjab
Mumbai
Mumbai
I need a data validation in which a location should come only once...
View 7 Replies
View Related
Mar 23, 2014
I have a list of isometric drawing numbers ending with a [underscore]weld number e.g. 1692-SG-0040-04_05.
Some welds are repaired--in that scenario the amended weld number will be 1692-SG-0040-04_05R1, and even 1692-SG-0040-04_05R2 if repaired for a second time.
On occasion a weld may be cut out entirely and a new weld done. The weld number for that will be 6317-FG-1690-02_06C1.
And here's a wrinkle I've just verified...a cut weld may also be repaired so the weld number will look like 1698-SG-0077-01_04C1R1.
Is there a formula to count these as one weld:
1692-SG-0040-04_05
1692-SG-0040-04_05R1
1692-SG-0040-04_05R2
This as one weld:
6317-FG-1690-02_06
6317-FG-1690-02_06C1
6317-FG-1690-02_06C2
...and this as one weld:
1698-SG-0077-01_04
1698-SG-0077-01_04C1
1698-SG-0077-01_04C1R1
View 7 Replies
View Related
Jan 24, 2012
I am having trouble creating a function to count duplicates of duplicates.
An example of the data table 1 is:
Product 1 2nd
Product 1 2nd
Product 1 New
Product 1 New
Product 1 Flt
Product 2 2nd
Product 2 New
Product 2 New
Product 2 Flt
Product 2 Flt
Product 3 2nd
Product 3 2nd
Product 3 2nd
Product 3 New
Product 3 Flt
I created a new table (table 2) and made a list of all the Products on table 1 and removed the duplicates. I now have 3 columns with titles New, 2nd and Flt as follows:
New 2nd Flt
Product 1 XX XX XX
Product 2 XX XX XX
Product 3 XX XX XX
I am trying to count the duplicates for each product (XX), but I can't seem to work it out. I've tried the MS help function, but unsure of the actual formula I need to be using.
I am using Windows 7 and MS Office 2010.
View 2 Replies
View Related
Apr 3, 2014
I have data in the form of a table. For example a list of duplicate names, each of these names have a corresponding date (lease expiry date). The problem is that the duplicate names have varying dates. So the remove duplicates function does not work because i need to remove the duplicate names with the older dates. I want the latest dates to remain behind.
Data Currently: Solution should be:
Sue1/3/2014Mike8/8/2014
Jay25/4/2013Sue1/3/2014
Jay25/4/2013Jay25/4/2013
Mike8/8/2014
Mike8/8/2014
Sue1/3/2014
Sue25/6/2012
Sue1/3/2014
Sue5/7/2012
Jay2/2/2011
Mike5/5/2010
View 1 Replies
View Related
Jun 10, 2014
I'm trying to find out the rule for de-duplicating data. I am removing duplicates based on an identification number in a data set of about 6000 records, including the duplicates (some records appear about 4 times). Due to the nature of the data I'm working with, there are only a handful of records that are "true" duplicates, i.e. some of the records appear 4 times but there is a difference in terms of location, etc and some are true duplicates in that there is no difference.
I need to know how Excel removes duplicates - does it only keep the first line that it finds for that identification number? Also, is there a way that I could create a rule for it to keep the record with the highest rate for example?
View 3 Replies
View Related
Dec 26, 2012
I would like to use the sumproduct or countif formula for a set of data. I have looked around the forum and have not found what I need so
Column A______Column B
West___________1
West___________2
West___________1
West___________2
South__________ 1
South__________2
South__________1
South__________2
I want to count column B for all "West" (column A) and I don't want duplicates. So it would count two unique characters for West and two unique characters for South. I want "west" and "south" separated.
View 3 Replies
View Related
Jan 3, 2010
Let's imagine I have many informations about 'Domains' such as Emails, phones, etc. My lists are long and it happens very often that I have the same domain (row) many times. No way to remember each time if I already filled the informations for this specific domain before.
THat's why I want to know if there is a way that ALL the informaitons on the Row of : Domain A will automatically be copied to another row where it will find another Domain A
The Conditional formatting (highlight duplicates) do part of the job by telling me where are the duplicates, but i need the other infos to be copied.
View 9 Replies
View Related
Jan 1, 1970
THis looks close to what I have been trying to find for months but what if I want to delete duplicates after pasting instead of making an error code? For instance, I have a list of names and addresses and I want to delete any entire row with a duplicate address.
View 9 Replies
View Related
Mar 12, 2014
I have several fields in a row that contain names of files e.g. 123.xlsx. Some fields will contain file names that will be duplicates of each other and some will be blank entries (although the blank entries can be changed to a value such as 'n/a' or 'no' etc if required).
I require only the non duplicate values to appear in the final cell, each separated with ';'.
My data is in row 2 of a spreadsheet and in every other column (A,C,E,G,I,K,M...for 45 instances in total).
I have used the following formula to identify the unique values (example below for the first four cells):
=A2&IF(C2=A2,"",","&C2)&IF(OR(E2=A2,E2=C2),"",","&E2)&IF(OR(G2=A2,G2=C2,G2=E2),"",","&G2)
This works well and if there are several blank entries then I use a SUBSTITUTE function to change the multiple ',,,,' to a single ';'. So I only see the unique file names in the final cell, separated with ';'.
However, the above formula becomes longer and longer when each cell is added to it. I have over 40 cells that need to be added and I wondered if there was a better way of doing this?
View 6 Replies
View Related
Mar 7, 2014
What I'm trying to do is lookup a description matching the date on worksheet 1 cell (a2) and go to worksheet 2 and copy all descriptions associated with that date into worksheet 1 in cells b3:b6 (array may vary) without duplicates.
The date and descriptions relate to safety inspections and I am trying to include them in a report. The answer is important to me as well as the benefits to many others.
On Work Sheet 1 Looks Like
[Code] .....
View 2 Replies
View Related
Jun 18, 2014
I have four columns of data (in column B) across 4 worksheets within the same workbook. The rows in this columns contain text. In some cases the text in various rows of each columns are identical, and in other cases they are unique. There is a percentage in column C for each text in column B.
What I'm trying to do is as follows:
Create a consolidated list of all four columns into a single column (say column A in a separate sheet). Therefore, each row in this consolidated column will have a unique text based value.
Have the percentage value for each unique text based entry in column B of the separate sheet. if the text based contents in the column A row is one of those that had the same entry in the original four columns, then the average of the percentages will appear in column B.
View 1 Replies
View Related
Sep 11, 2013
I have giant excel spreadsheet that I am trying to clean up. I want to make one of the columns (with around 3000 rows and may of them are the same) into a drop down list. However, when I try doing so from the data validation tab, I get duplicates, triplicates, etc. of the same item.
View 1 Replies
View Related
Feb 27, 2014
I run a report each week and would like to combine it with my Updated Members workbook and then remove duplicates.
Is there a way to add this new data (ActiveList.xlsx ) to the last row of data of my (UpdatedMembers.xlsx) Sheet and remove duplicates. (I would like the new additions to stay on the bottom of my original data in the UpdatedMembers sheet, so that I can add notes to their row)
I am still learning my way around excel and all of my attempt to do this have been unsuccessful. how to pull this off.
View 1 Replies
View Related
May 9, 2014
I am filtering data using listbox (userform).
I want listbox show only names(without duplicates).
View 4 Replies
View Related
Apr 4, 2008
Here is what i'm working with:
http://img90.imageshack.us/my.php?im...uestionas8.jpg
what i want it to do is pull out the stuff in column C and put it in column F. But i don't want it to duplicate it at all. So for instance in this case, submarket 1 would be cary-morrisville-apex / submarket 2: northwest wake / submarket 3: north wake etc.
would this involve a VLOOKUP function but an if then function as well? I'm not really sure how to go about it.
View 14 Replies
View Related
Feb 9, 2012
I have a 5 column document with about 3000 entries that I need streamlining. THe first column is a student's name, then grade. The next 2 columns are for Honour Roll Standing (A or B) and/or an Effort Honour Roll Standing (E). Finally there is also a "term" column indicating if they received this award is Term 1, 2 or 3.
An example source file is found here: [URL]...
As you can see, there are many students who have received an award in all 3 terms, however there are also some students that only received standing for 1 or 2 terms.
I need to manipulate the data so that there are no duplicate names and all the data regarding Honour Roll standings and which term it occured in are all laid out in one row per student.
IDEAL COLUMN HEADINGS:
NAME | Grade | Term1 HonourRoll | Term1 Effort | Term2 HonourRoll | Term2 Effort |
etc etc etc
View 9 Replies
View Related