Noting Duplicates And Identifying Data Within

Feb 9, 2010

I have attached a sample sheet which deals with property sales data, in reference to a two-part question.

1. If the row has two, or more, rows share the same value in column 'E', it needs to be identified with a 'Y' in Column G. In looking through old threads, this seems possible, though I could not find and answer I could 'bend' to work. If this is possible, can the following be included?

2. If two or more rows share the same value in column 'E', list the identifiers (value in Column A) for the others in Column H, separated by commas "," or slashes (preferred) "/".
ie.
"00370600000700 'NICHOLLS JOHN W & CARLA R 11/27/2000 85000 '260647 W Y '00370600000800" and
"00370600000800 'NICHOLLS JOHN W & CARLA R 11/27/2000 85000 '260647 W Y '00370600000700" or
"'00370500000801 'FRAHM FREDERICK/ERIK/KRYSTYNA 06/17/2004 110000 '288904W Y '00370500000802/'00370500000803"

The sample sheet attached includes 26 rows of data with several 'doubles' and one 'triple' 'duplicates'.

Please note that in the 'real' file, it has slightly less than 200,000 rows and I have seen 'dupicates' up to 40 with the same value in Column 'E'

View 14 Replies


ADVERTISEMENT

Data Scrubbing - Identifying Duplicates And Assigning One Number

Feb 11, 2014

I am involved in a software conversion that is taking 4 full time folks over 5 weeks to clean up and assign an alpha-numeric sequential number to each vendor, client. Each scrubber is reviewing an excel spreadsheet containing the names, addresses, FID, telephone, etc. of our vendors and customers. This information is being pulled from 2 separate sources. We are assigning a BP # to the main office location and not retiring that one. then we go on to identifying the dups. All dups get a Y to be retired, but if they have a different address then the main one, we place a Y to bring that address over under that BP#.

Ultimately, we end up with 2 systems combined into one dumping all old numbers assigned and giving each vendors, customers, etc. a new BP# that may have muliptple addresses.

How can we assign a alpha numeric number without going through each individual line...over 900,000 of them to do. Key is to identify duplicate addresses and duplicate names. Some names might be RK Electrical or Robert King Electrical but the address will be duplicated usually.

View 9 Replies View Related

Identifying Duplicates In List

Aug 6, 2009

Here is the deal I have 4 columns. Each line gives you the following information:
The Id is the record number, Code_Name is a code for each Fox in the study, date and area is a sub area in a bigger grid. Basically I have an area divided by squares and every time a marked fox enters in one of my squares a new line in the data is created. What I won’t to know is if a fox when in my area of study will return to the same squares or not.

ID

CODE_NAME

Date

Area

Logical
116

SSS1
02-Jan-09
1A1

273

RRR1
02-Jan-09
2A2

2959

BBB1
02-Jan-09
1B1

2959

What I’m trying to achieve is a 5th column were ill get a logical value of TRUE or FALSE if, for each fox in the next available record a fox went back to the same square or not.
So if you check for the fox RRR1 I have 2 records one in 2 of January in area 1A1 and a second in 7 of January in the same area. For the Fox BBB1 you will see that she was always in different areas and for CCC1 she only came back to one square.

The problem is I have over 400 fox’s and 12000 records and I’m trying to get a way of doing it automatically.

View 9 Replies View Related

Identifying Non-Duplicates In Numeric Array

Jul 18, 2012

I have organized account numbers from two systems onto a spreadsheet, with numbers from System 1 arrayed in Column A and numbers from System 2 arrayed in Column B. I need to evaluate the numbers in both columns and isolate the numbers that are NOT DUPLICATES across the two systems (Columns A and B) and return a list of non-duplicate numbers in Column C. Here is what the table would look like:

System 1
System 2
Non-Duplicates

173
215
173

[Code] .......

What is the best approach to use?

View 4 Replies View Related

Identifying Duplicates In A List Using Conditional Formatting

Nov 30, 2009

I would like to identify duplicates in a list using conditional formatting in Excel 2007.

I have tried choosing to identify duplicates using the formula that I have found on many threads throughout the message board:

=COUNTIF($A$1:$A1,$A1)>1.

This function works up to 15 characters in a cell, but Excel seems to be treating all digits after the first 15 as the same, resulting in a "fuzzy match" where I want an exact match. Many of the values in my list are 18 characters long, in text format to prevent rounding.

I've noticed that Excel treats the 18-characters values the same way when sorting; for example, it treats these two values as the same:

'234567891011121314
'234567891011122413

Is there a way to force Excel to examine those last four digits for the purpose of sorting & identifying duplicates?

View 9 Replies View Related

Identifying Peaks In Data

May 5, 2009

I've collected some data from a GPS logger regarding the speed of an athlete. I want to calculate how many sprints this particular athlete undertook during a training session.

Sprinting is defined as a speed of > 20 kph.

One sprint would be the attainment of one peak >20 kph before decreasing below 20 kph.

It's easy to identify the 3 peaks and thus sprints from the xy scatterplot in the attached file, but I'm struggling to find a way to calculate this.

View 11 Replies View Related

Identifying Incorrect Data.

Feb 10, 2009

In column H I have a list of numbers seperated by a space, the number of lines can change. In column L I have a list of numbers which can change either expand or retract.

I would like to check each cell in column H and if any numbers are not listed in column L then it/they should be shown in column G.

Example1 H2 shows 6 11, therefore cell G2 should show 11.

Example 2 H6 shows 5 6 9 11 therefore G6 should show 9 11

Sheet1  HIJKL1Container   ID26 11   135 8 11   245 7 11   355 7   565 6 9 11   675 6 9     Excel tables to the web >> Excel Jeanie HTML 4

View 9 Replies View Related

Identifying Duplicate Data Between Workbooks

Apr 22, 2009

I am looking for the easiest way to find duplicate Work Order numbers that exist in 2 separate Workbooks. EX. Workbook 1 Sheet one contains the numbers 1-100 in A1:A100
Workbook 2 Sheet one contains X amount of the numbers between 1-100 located.

somewhere in A:A. For arguments sake let's assume those numbers are 3,6,33,87,99.
What would the formula be to return the values that are in both of the workbooks?

View 5 Replies View Related

Identifying Data To Selected Criteria

Mar 11, 2014

I have small table i would like to create. Now, it can be done manually ( but its be very very time consuming) but im sure of a way using IFs and VLOOKUPs so that the data selection can be done automatically...

so in column 1 i have various valuations from 0 to anything 50mil plus that i need to then separate into 4 different columns based on their size. so column A would have 0 - 250k, column B 251k to 500k, column C 501k to 1million and etc etc...

vals 0-250 251-500 501-1mill

555,000 300,000
150,00 75,000

Please see attached ... testing values 1.xlsx‎

View 2 Replies View Related

Identifying Numeric Data With MS Query

Dec 6, 2006

I am trying to slim down my database results in Excel via MS Query by searching for Part ID's that are numeric (we have parts that also contain letters....I want to weed those out).

In all my searching on the web, I thought the ISNUMERIC() function should be the function for this, but I keep getting an ORA-00904::"ISNUMERIC":invalid identifier....

Is this function supposed to work or is there another function that will do this

This is my SQL statement so far, which works to get parts that are 6 characters long only:

SELECT PART.ID
FROM SYSADM.PART PART
WHERE (LENGTH(PART.ID)=6)
When I change it to this to get parts that are numeric, it gives the error above:

SELECT PART.ID
FROM SYSADM.PART PART
WHERE (LENGTH(PART.ID)=6) AND (ISNUMERIC(PART.ID)=1)

View 9 Replies View Related

Identifying Varying Data And Rank Formula

Sep 17, 2009

i am trying to work out how to use the rank formula to rank numbers in column B and keep them in unison with Column A.

So Column A has say 5 1's with column B having different scores then continuing under 1 in A is 2 and so on is there a way to continue the ranking formula without manually changing the cell ranges?

so =rank(B1,$B$1:$B$7,1) but can i do that if A =1 and then A=2 etc ?
so if A=1,rank(B1,$B$1:$B$7,1)

View 7 Replies View Related

Identifying The Same Transaction With Multiple Rows Of Data

Sep 21, 2009

I currently have a database that is similar to the example I've pasted below:

TypeDateTimeA10/01/200910:00B10/01/200910:00C10/01/200910:00D10/01/200912:00E10/01/200912:00

As you can see the Date and Time are repeated for several rows. This is how the data I receive comes through as A,B and C refer to a single transaction and D & E refer to another transaction.

Ideally what I would like to acheive is:

TypeDateTimeTransactionA10/01/200910:001B10/01/200910:00C10/01/200910:00D10/01/200912:002E10/01/200912:00

Where Excel can compare the date and times of each row and look for matching rows above and below it and then fill in a column next to it indicating that x number of rows are linked to a single transaction - preferably labelling them in some order to I can tell how many transactions there are.

View 9 Replies View Related

Identifying Data In One Column Based On Highest Figure Data In Another Column?

Jun 19, 2014

I am trying to write a formula that shows the best grade for each each student and the subject in which
that grade was achieved.

In a previous post i was shown how to identify the highest grade

MAX(IF(A$3:A$34=A7,C$3:C$34))

I hit CTRL, SHIFT & ENTER to activate the formula - Result in column E

However, I also want to identify the subject in which they scored their highest grade in column F

I have two problems

1. I don't know how to write a formula that brings in the subject based on the grade for each student

2. I don't know how to write a formula in case there is a tie (see DAVID L)

View 11 Replies View Related

Pivot Table To Not Count Duplicates - But Show Duplicates When Click Into It

Aug 24, 2011

In short, I would like a pivot table to only count unique values, but when I click into the pivot I would like to show all instances of that value. For example:

I have a table of data that I am creating a pivot table from. There are fields for Customer ID, Task Name, Age, and Notes. There will be multiple records for a single Customer ID each time it has new notes.

I would like to create a pivot table that has Task Name in the Row Labels, Age in the Column Labels, and count of Customer ID in the Values, so that, for example, I can see how many accounts have been in the Design task for 2 days. However, when I do this it counts each record, but I would like it to count each unique Customer ID. Also, when I click into the pivot, instead of pulling up one line per Customer ID, I would like it to pull up each instance of Customer IDs in that Task Name/Age combination (similar to doing a DISTINCT in SQL).

View 3 Replies View Related

Data Validation Without Duplicates

Jan 7, 2014

I have alist in which a location is mentioned multiple time..

eg:

Delhi
Delhi
Delhi
Punjab
Punjab
Mumbai
Mumbai

I need a data validation in which a location should come only once...

View 7 Replies View Related

Count Duplicates And Duplicates With Suffix As One Instance

Mar 23, 2014

I have a list of isometric drawing numbers ending with a [underscore]weld number e.g. 1692-SG-0040-04_05.

Some welds are repaired--in that scenario the amended weld number will be 1692-SG-0040-04_05R1, and even 1692-SG-0040-04_05R2 if repaired for a second time.

On occasion a weld may be cut out entirely and a new weld done. The weld number for that will be 6317-FG-1690-02_06C1.

And here's a wrinkle I've just verified...a cut weld may also be repaired so the weld number will look like 1698-SG-0077-01_04C1R1.

Is there a formula to count these as one weld:
1692-SG-0040-04_05
1692-SG-0040-04_05R1
1692-SG-0040-04_05R2

This as one weld:
6317-FG-1690-02_06
6317-FG-1690-02_06C1
6317-FG-1690-02_06C2

...and this as one weld:
1698-SG-0077-01_04
1698-SG-0077-01_04C1
1698-SG-0077-01_04C1R1

View 7 Replies View Related

Excel 2010 :: How To Count Duplicates Of Duplicates

Jan 24, 2012

I am having trouble creating a function to count duplicates of duplicates.

An example of the data table 1 is:

Product 1 2nd
Product 1 2nd
Product 1 New
Product 1 New
Product 1 Flt
Product 2 2nd
Product 2 New
Product 2 New
Product 2 Flt
Product 2 Flt
Product 3 2nd
Product 3 2nd
Product 3 2nd
Product 3 New
Product 3 Flt

I created a new table (table 2) and made a list of all the Products on table 1 and removed the duplicates. I now have 3 columns with titles New, 2nd and Flt as follows:

New 2nd Flt
Product 1 XX XX XX
Product 2 XX XX XX
Product 3 XX XX XX

I am trying to count the duplicates for each product (XX), but I can't seem to work it out. I've tried the MS help function, but unsure of the actual formula I need to be using.

I am using Windows 7 and MS Office 2010.

View 2 Replies View Related

Removing Duplicates From A List Of Data

Apr 3, 2014

I have data in the form of a table. For example a list of duplicate names, each of these names have a corresponding date (lease expiry date). The problem is that the duplicate names have varying dates. So the remove duplicates function does not work because i need to remove the duplicate names with the older dates. I want the latest dates to remain behind.

Data Currently: Solution should be:
Sue1/3/2014Mike8/8/2014
Jay25/4/2013Sue1/3/2014
Jay25/4/2013Jay25/4/2013
Mike8/8/2014
Mike8/8/2014
Sue1/3/2014
Sue25/6/2012
Sue1/3/2014
Sue5/7/2012
Jay2/2/2011
Mike5/5/2010

View 1 Replies View Related

Rule For Removing Duplicates In Data?

Jun 10, 2014

I'm trying to find out the rule for de-duplicating data. I am removing duplicates based on an identification number in a data set of about 6000 records, including the duplicates (some records appear about 4 times). Due to the nature of the data I'm working with, there are only a handful of records that are "true" duplicates, i.e. some of the records appear 4 times but there is a difference in terms of location, etc and some are true duplicates in that there is no difference.

I need to know how Excel removes duplicates - does it only keep the first line that it finds for that identification number? Also, is there a way that I could create a rule for it to keep the record with the highest rate for example?

View 3 Replies View Related

Countif With 2 Columns Of Data And No Duplicates

Dec 26, 2012

I would like to use the sumproduct or countif formula for a set of data. I have looked around the forum and have not found what I need so

Column A______Column B
West___________1
West___________2
West___________1
West___________2
South__________ 1
South__________2
South__________1
South__________2

I want to count column B for all "West" (column A) and I don't want duplicates. So it would count two unique characters for West and two unique characters for South. I want "west" and "south" separated.

View 3 Replies View Related

Find Duplicates + Copy Data

Jan 3, 2010

Let's imagine I have many informations about 'Domains' such as Emails, phones, etc. My lists are long and it happens very often that I have the same domain (row) many times. No way to remember each time if I already filled the informations for this specific domain before.

THat's why I want to know if there is a way that ALL the informaitons on the Row of : Domain A will automatically be copied to another row where it will find another Domain A

The Conditional formatting (highlight duplicates) do part of the job by telling me where are the duplicates, but i need the other infos to be copied.

View 9 Replies View Related

Preventing Duplicates When Entering Data

Jan 1, 1970

THis looks close to what I have been trying to find for months but what if I want to delete duplicates after pasting instead of making an error code? For instance, I have a list of names and addresses and I want to delete any entire row with a duplicate address.

View 9 Replies View Related

Removing Duplicates From Long Row Of Data

Mar 12, 2014

I have several fields in a row that contain names of files e.g. 123.xlsx. Some fields will contain file names that will be duplicates of each other and some will be blank entries (although the blank entries can be changed to a value such as 'n/a' or 'no' etc if required).

I require only the non duplicate values to appear in the final cell, each separated with ';'.

My data is in row 2 of a spreadsheet and in every other column (A,C,E,G,I,K,M...for 45 instances in total).

I have used the following formula to identify the unique values (example below for the first four cells):
=A2&IF(C2=A2,"",","&C2)&IF(OR(E2=A2,E2=C2),"",","&E2)&IF(OR(G2=A2,G2=C2,G2=E2),"",","&G2)

This works well and if there are several blank entries then I use a SUBSTITUTE function to change the multiple ',,,,' to a single ';'. So I only see the unique file names in the final cell, separated with ';'.

However, the above formula becomes longer and longer when each cell is added to it. I have over 40 cells that need to be added and I wondered if there was a better way of doing this?

View 6 Replies View Related

Search Cell Data And Copy Without Duplicates?

Mar 7, 2014

What I'm trying to do is lookup a description matching the date on worksheet 1 cell (a2) and go to worksheet 2 and copy all descriptions associated with that date into worksheet 1 in cells b3:b6 (array may vary) without duplicates.

The date and descriptions relate to safety inspections and I am trying to include them in a report. The answer is important to me as well as the benefits to many others.

On Work Sheet 1 Looks Like

[Code] .....

View 2 Replies View Related

Consolidate 4 Columns Of Data Which May Include Duplicates?

Jun 18, 2014

I have four columns of data (in column B) across 4 worksheets within the same workbook. The rows in this columns contain text. In some cases the text in various rows of each columns are identical, and in other cases they are unique. There is a percentage in column C for each text in column B.

What I'm trying to do is as follows:

Create a consolidated list of all four columns into a single column (say column A in a separate sheet). Therefore, each row in this consolidated column will have a unique text based value.

Have the percentage value for each unique text based entry in column B of the separate sheet. if the text based contents in the column A row is one of those that had the same entry in the original four columns, then the average of the percentages will appear in column B.

View 1 Replies View Related

Duplicates In Dropdown List From Data Validation?

Sep 11, 2013

I have giant excel spreadsheet that I am trying to clean up. I want to make one of the columns (with around 3000 rows and may of them are the same) into a drop down list. However, when I try doing so from the data validation tab, I get duplicates, triplicates, etc. of the same item.

View 1 Replies View Related

Combine Data From One Sheet To Another - Removing Duplicates

Feb 27, 2014

I run a report each week and would like to combine it with my Updated Members workbook and then remove duplicates.

Is there a way to add this new data (ActiveList.xlsx ) to the last row of data of my (UpdatedMembers.xlsx) Sheet and remove duplicates. (I would like the new additions to stay on the bottom of my original data in the UpdatedMembers sheet, so that I can add notes to their row)

I am still learning my way around excel and all of my attempt to do this have been unsuccessful. how to pull this off.

View 1 Replies View Related

Show Name Without Duplicates Using Filter Data Using ListBox

May 9, 2014

I am filtering data using listbox (userform).

I want listbox show only names(without duplicates).

View 4 Replies View Related

Function To Pull Out Data From One Column To Another Without Duplicates

Apr 4, 2008

Here is what i'm working with:

http://img90.imageshack.us/my.php?im...uestionas8.jpg

what i want it to do is pull out the stuff in column C and put it in column F. But i don't want it to duplicate it at all. So for instance in this case, submarket 1 would be cary-morrisville-apex / submarket 2: northwest wake / submarket 3: north wake etc.

would this involve a VLOOKUP function but an if then function as well? I'm not really sure how to go about it.

View 14 Replies View Related

Removing Duplicates In One Column And Summarizing Data

Feb 9, 2012

I have a 5 column document with about 3000 entries that I need streamlining. THe first column is a student's name, then grade. The next 2 columns are for Honour Roll Standing (A or B) and/or an Effort Honour Roll Standing (E). Finally there is also a "term" column indicating if they received this award is Term 1, 2 or 3.

An example source file is found here: [URL]...

As you can see, there are many students who have received an award in all 3 terms, however there are also some students that only received standing for 1 or 2 terms.

I need to manipulate the data so that there are no duplicate names and all the data regarding Honour Roll standings and which term it occured in are all laid out in one row per student.

IDEAL COLUMN HEADINGS:

NAME | Grade | Term1 HonourRoll | Term1 Effort | Term2 HonourRoll | Term2 Effort |

etc etc etc

View 9 Replies View Related







Copyrights 2005-15 www.BigResource.com, All rights reserved