Finding Duplicates Out Of Millions
Oct 22, 2009
I have to scrub files of 20,000 phones numbers against a file of several million phone numbers on the national do not call list.
On sheet one I have all 20,0000 phone number and then on sheet two in 5 columns I have roughly 2 million phone numbers. I need to know if any of the 20,000 phone numbers are in the 2 million on sheet two.
Right now I am simply using a vlookup formula but it is taking a very long time to update all of the fields.
View 9 Replies
ADVERTISEMENT
Jul 22, 2009
I have data that has about 10 duplicate values (UTC Time) in one column and another column with number values (depth ft) that vary. I need to obtain the maximum (highest) value in the depth column and remove the other duplicates to filter out the low values. So for the data example below for UTC 15:56:28 I only want the 5.7 row, for 15:56:29 I want5.3 row and so on. I can attach the sheet. - this is a huuge dataset so manual filtering wont work. Data is from a sonar that gives 10 depth readings per second - I only need one depth tat is the highest value.
IDUTCDepth (FT)
115:56:284.3
215:56:284.4
315:56:285
415:56:285.4
515:56:285.7
615:56:285.5
715:56:285.6..........
View 3 Replies
View Related
Oct 3, 2009
I have a workbook that I want to find if I have any duplicate numbers in a specific area.
The area of cells that I am checking is C3 through AO70
I am checking for numbers between 95 and 800. These are all ID# of individuals and not all the numbers between 95 and 800 are used. IE: 97 through 100 are not used ect.
I have already written a macro that does something else and I can use it to check each number as it comes up. However, once the number comes up I don't know how to use it to check the area.
If I can check all the area at one time to find duplicates it would be easier.
I do not know how to do either way but I can adapt my macro to whatever way is possible.
View 9 Replies
View Related
Feb 7, 2014
I have a workbook of appointments, I need a formula to list duplicate appointments and display them using 2 criteria (date and time).
See attached workbook : duplicate entries.xlsx‎
View 7 Replies
View Related
Jun 25, 2008
Just wondering if anyone has a macro, or formula that would allow me to find out (and possibly highlight), when any value in column A is equal to any value in column B. Im dealing with about 2000 rows so its almost impossible to complete manually.
Sorry the heading is supposed to read need help finding duplicates between 2 COLUMNS
View 14 Replies
View Related
Oct 10, 2008
way to wrap or format anything in this post. I dont think the text I put here is code, but I want to be sure, after receiving a moderators infraction for failing to properly wrap code in a previous post.
Now:
I have a wks in which the first column is a list account #'s and the folowing columns are specifics of transactions or interactions.
Lets say it is a movie rental customer list that lists each rental, and column "A" is the customer number, column "B" declares if it was returned late.
I need to compile a list of "all" rentals by customers who have "EVER" had a "late return" or a "YES" in column "B".
I need to find all account records/rows of accounts that at any time had a "YES" in column "B" even if the some or many of that customers rentals/entries have "NO" in column "B"
so say: ...
View 14 Replies
View Related
Aug 23, 2012
I'm trying to condense my email lists in order to stop people receiving the same email having signed up to several lists. How do I compare 5 different columns to find email addresses which appear in more than one...
View 1 Replies
View Related
Jan 28, 2010
I have a spreadsheet with 7000 lines exported from a database. I'm looking for lines that exist with an @@2 that don't have a corresponding @@1. Let me explain.
SV10000000@@1
SV10000000@@2
SV10101000@@2
I want to keep the first two lines because there is an @@1 associated with an @@2. I'm looking to single out and delete lines that have @@2 that don't have a corresponding @@1 associated with it.
View 9 Replies
View Related
Sep 16, 2008
This should be an easy one, it has slipped my mind.
I have columns A and B, I want to find any names that appear in both column A and column B. I want them to show in a certain color.
View 11 Replies
View Related
Apr 1, 2009
Hello everyone....
I have a project where I need items for different boxes.
i have 20 boxes that need the same amount of items. However, when I came towards the end, i ran out of items.
For example
BOX A IS MISSING
ITEM 1 AND 2
BOX B IS MISSING
ITEM 2 AND 5
BOX C IS MISSING
ITEM 1 AND 5
I have all in a spreadsheet all the items that are missing per box. Here comes the main question....
How do I program my spreadsheet find the items that are missing in each box and summarize in another sheet?
The summary I am looking for is....
ITEM 1 - 5 (MISSING)
ITEM 2 - 9 (MISSING)
and so on...
I started doing the code, but I haven't got too far.
View 4 Replies
View Related
Jul 19, 2012
I have two lists of numbers.
The first list contains site numbers of people who havent responded to me.
The second list is the master list of site numbers along with a column showing the date they responded.
Now, a site number is built like this:
123456/0001
123456/0002
So it is possible for the same 6 digits to appear more than once in the master list.
What I need to do is to compare the first 6 digits in the non responder list against the master list, because some sites, like the example above, may have more than one '0001' tag and so if they have responded to me from site '0002' I dont want to spam their other sites with emails.
I've tried using match and various formulas I've found from google etc, but nothing seems to work!
The goal of this is to get a list of non respondents that have not responded from any of their sites listed in the master list.
Non Responses
Master List
Date of Response
203289/0001
201162/0003
06/06/12
[code]...
View 3 Replies
View Related
Apr 8, 2013
I have a database with ~18000 rows and 29 columns. I would like to filter the data by duplicates in one column, based on total, but keep the remaining data in the row. For example: I have account numbers listed in one column, often duplicates. I can get the total in a pivot table no problem, but need the other data associated with that account. I do not need to see all accounts, only duplicates for accounts listed say greater than 5 times. The data in columns B+ are important.
I have a HUGE vlookup I created to paste in the pivot table data (account numbers and totals) to run a look-up based on those numbers, but I see that running into problems when you run 4k+ look-ups.
I want to see accounts listed only 5+ times, include that total (as in a pivot table) and the remaining 28 columns. I have tried to run this in a pivot table completely, but still too much data to process (plus all the subtotals that I have to keep removing).
View 1 Replies
View Related
Jan 13, 2014
I have over 400 thousand rows in my data-set and I'm trying to find duplicates using a countif or similar formula. Let's say I have the following:
Blue
Yellow
Green
Blue
Blue
Yellow
Pink
I need the formula to increment the count. So for example, the first time Blue appears, it equals 1, the second it appears, it equals 2, third time is 3 and so on.
And, I need it to actually work with 400 thousand rows. I have tried using the following formulas without success as Excel keeps crashing:
=IF((COUNTIF(T:T,$T2)>1),COUNTIF(T:T,$T2),"")=IF(COUNTIF(T:T,T2)>1,2,1)
That second one equals "2" if there's more than 1, which will work for me as well. Better would be if it increments though.
The purpose of having a count is so that I can pivot the data, then filter out all the "1"s and report on it. Therefore I can't use conditional formatting, or simply using Excels Remove Duplicates feature. I need the same data-set for other pivots and charts.
View 8 Replies
View Related
Jul 9, 2013
I have a list of text values in column X. I need to come up with a formula in column Y.
X
Y (RESULTS OF REQUIRED FORMULA)
Comment
1
HAT
1
First HAT in column
2
[Code]....
I can't play about with the natural order of the spreadsheet, so there's no chance I can re-sort the data into column X and (easily) identify the duplicates that way. So, it could be that the duplicated value(s) will appear in any cell within that column.
I need to identify whether the item is a duplicate in the unsorted list. Ideally, the first entry of a set of duplicates will be given 1, then the subsequent duplicates themselves given a 0 (zero). It's to subsequently do some counts on.
I guess that as long as the one of the entries in the duplicates is marked with a 1, while the others are 0 (zero), that's all that's important.
View 3 Replies
View Related
Sep 4, 2008
I have a roster for a large group in excel and would like to have an easy way to highlight if there are duplicate entries in the roster as we are merging multiple smaller lists together.
View 9 Replies
View Related
Apr 16, 2007
I have many rows of data. How can I find duplicate cells in a column?
View 9 Replies
View Related
Mar 3, 2008
i have duplicate cell entries occuring. I have a column of about 8000 entries (Column B) and would like to have a cell at the top of my spreadsheet that displays where the first duplicate resides (Row No. will suffice).
At present i have a conditional format on dupllicates, but is is a big task to scroll down through all the data looking for them.
View 9 Replies
View Related
Feb 13, 2014
I'm going through financial statements and I would like to be able to run full blown analysis on them. Most public financial reports are written so that a lot of zero's do not take up the useful space. As such (and there is a ton of this solution out there I've found), many people want to take $1,500,000,000 and turn it into 1.5M
I however, do not. I want to convert it backwards. So when something says 5.19 I would like to format it in such a way that it will read (and I can run calculations against it) as 5,190,000,000 (there are no alpha characters, there are instances where there will be negative numbers). Then I could copy this to the 17 different financial statements I've already got rock'n rolling in Excel right now.
View 4 Replies
View Related
Apr 15, 2009
Nano Caps are companies with a market value below $50 Million
Micro Caps are companies with a market value between $50 Million and $300 Million
Small Caps are companies with a market value between $300 Million and $2 Billion
Mid Caps are companies with a market value between $2 Billion and $10 Billion
Large Caps are companies with a market value between $10 Billion and $200 Billion
Mega Caps are companies with a market value above $200 Billion.
I have this part so far that someone helped me out with that belonged with this criteria:
Micro Caps are companies with a market value below $300 Million
Small Caps are companies with a market value between $300 Million and $2 Billion
Mid Caps are companies with a market value between $2 Billion and $10 Billion
Large Caps are companies with a market value above $10 Billion
=LOOKUP(P2,{0,300000000,2000000001,10000000001},{"Micro","Small","Mid","Large"})
I came up with this one for the new criteria:
=LOOKUP(P2,{0,50000000,300000000,2000000001,10000000001,200000000001},{“Nano”,"Micro","Small","Mid", "Large",”Mega”})
it keeps giving me an error message..... the thing is the criteria that its based off of( market cap) is in millions but this is the function thats in the cell: =K2*T2/1000000
View 2 Replies
View Related
Jan 25, 2013
I have roughly 3-4 million rows of data that i need to collate for data analytics. I'm using Excel 2010 and saw that the max it will go it just 1 milllion + lines.
View 6 Replies
View Related
Feb 24, 2009
To display 1,100,000 as $1.1M & 110,000 as $110.0K I've used the below Custom Number Formatting.
[>=1000000] $#,##0.0,,"M";[
View 9 Replies
View Related
May 22, 2009
I have a giant sheet that culminates in a graph. It is for various regions. When used in a country with in-control currency, you end up with a nice healthy number that looks good on the graph. When our Zimbabwe office uses it with their crazy inflation and high prices, we get a crazy number that goes into scientific notation...
Anyway, my question:
Given a number like 4,200,000 or 6,500 or 345,123, I want to be able to show as few trailing numbers as possible, and give a nice K or B or D or however you abbreviate Quadrillion (kM? for Zimbabwe) after it.
And, if possible, I'd like to do it with number formats, but I am flexible so long as it doesn't use VBA.
The lookup table would be:
Sheet1 AB1<1,000As is21,000K31,000,000Mil.41,000,000,000Bil.5>1,000,000,000,000Tril. Excel tables to the web >> Excel Jeanie HTML 4
All numbers have 2 significant digits, and I want to keep 2, so if it ends up as 4,200,000, I want the answer to read 4.2 Mil, etc.
I tried doing this with a lookup, and couldn't handle it.
View 9 Replies
View Related
Jan 29, 2014
I am using a spreadsheet as a score sheet for a competition. One of the columns is the student's GPA. After entering all the scores there are duplicate final scores. I need a way to have it look at the final score and then use the GPA so that it will not put a duplicate value in the final column.
al Column N is the Total Column, Column O has the Names that correspond to the Total Column. Currently I am taking this total and putting it into Column Q (High Scores) in high to low order. Column R should have the names that match the scores. But with duplicate scores, it is only putting the first name associated with the score. I would like to use the GPA as a final determining factor for the duplicate scores. The higher GPA would come before a lower GPA. I have tried to put an additional column to bring the GPA over to correspond with the High Scores Column, but could not get it to work.There are actually more names for the competition and the top 10 will be moved to a different sheet and further judged. I have attached a sample with the exact formulas that I am using.
View 1 Replies
View Related
May 7, 2014
What I need to do is sort certain entries in longer list (column A, it is in .csv format and needs to be in it so coordinates and names and ID, all sorted with commas) and I have another list (column C) which is shorter list of certain IDs. I googled and tried and got some results for the basic structure but the fuction seems to fail. It doesn't matter how I get that third list done, but there is only one criteria: since the list in column A is really long and those entries need to keep the .csv formatting, the function should copy that info what is in the matching cells.
Let me try to put it simple: .csv cells from column A that have matching ID from column C should be copied to column B (or N).
the function I'm working with right now is
=IF(ISERROR(FIND($C:$C; A1))=TRUE;"0";A1)
View 13 Replies
View Related
Aug 24, 2011
In short, I would like a pivot table to only count unique values, but when I click into the pivot I would like to show all instances of that value. For example:
I have a table of data that I am creating a pivot table from. There are fields for Customer ID, Task Name, Age, and Notes. There will be multiple records for a single Customer ID each time it has new notes.
I would like to create a pivot table that has Task Name in the Row Labels, Age in the Column Labels, and count of Customer ID in the Values, so that, for example, I can see how many accounts have been in the Design task for 2 days. However, when I do this it counts each record, but I would like it to count each unique Customer ID. Also, when I click into the pivot, instead of pulling up one line per Customer ID, I would like it to pull up each instance of Customer IDs in that Task Name/Age combination (similar to doing a DISTINCT in SQL).
View 3 Replies
View Related
Mar 23, 2014
I have a list of isometric drawing numbers ending with a [underscore]weld number e.g. 1692-SG-0040-04_05.
Some welds are repaired--in that scenario the amended weld number will be 1692-SG-0040-04_05R1, and even 1692-SG-0040-04_05R2 if repaired for a second time.
On occasion a weld may be cut out entirely and a new weld done. The weld number for that will be 6317-FG-1690-02_06C1.
And here's a wrinkle I've just verified...a cut weld may also be repaired so the weld number will look like 1698-SG-0077-01_04C1R1.
Is there a formula to count these as one weld:
1692-SG-0040-04_05
1692-SG-0040-04_05R1
1692-SG-0040-04_05R2
This as one weld:
6317-FG-1690-02_06
6317-FG-1690-02_06C1
6317-FG-1690-02_06C2
...and this as one weld:
1698-SG-0077-01_04
1698-SG-0077-01_04C1
1698-SG-0077-01_04C1R1
View 7 Replies
View Related
Jan 24, 2012
I am having trouble creating a function to count duplicates of duplicates.
An example of the data table 1 is:
Product 1 2nd
Product 1 2nd
Product 1 New
Product 1 New
Product 1 Flt
Product 2 2nd
Product 2 New
Product 2 New
Product 2 Flt
Product 2 Flt
Product 3 2nd
Product 3 2nd
Product 3 2nd
Product 3 New
Product 3 Flt
I created a new table (table 2) and made a list of all the Products on table 1 and removed the duplicates. I now have 3 columns with titles New, 2nd and Flt as follows:
New 2nd Flt
Product 1 XX XX XX
Product 2 XX XX XX
Product 3 XX XX XX
I am trying to count the duplicates for each product (XX), but I can't seem to work it out. I've tried the MS help function, but unsure of the actual formula I need to be using.
I am using Windows 7 and MS Office 2010.
View 2 Replies
View Related
Feb 25, 2014
I have been trying to write two formulas in one cell. I have been able to write them both separately but have been unable to join them both together. What I am trying to do if first search name them how many reoccurring numbers appear. I have provided an example below
a b c d
1 Tom 333
2 Sam 22
3 Sam 22
4 John 5
5 Sam 22
6 Sam 1
7 Tom 3
8 Tom 333
So the answer would be
Tom = 2
Sam = 2
John = 1
View 8 Replies
View Related
Apr 8, 2014
I'm trying to Count duplicate text on sheet 2 and populate the number of times repeated on sheet 1.
On sheet 1 I have A3:A128 and would like the number of times these respective cost centers are repeated in Sheet 2 to populate Column B of Sheet 1.
File attached.
Not sure which formula to use. I tried CountIF but didn't work with 2 sheets.
View 7 Replies
View Related
May 19, 2009
The data is copied daily to load and duplicates are removed and new entries are copied to list sheet, by searching on the concatenation.
The problem i have is that i also want to add to the list, the row lines from the load sheet if the date changes in column G. Not really sure how yo approach this. I have attached sample data.
View 11 Replies
View Related