This function requires the bigdata library section to be loaded.
bd.duplicated(data, columns=NULL, name="DUPLICATED", copy=F)
bdFrame
,
data.frame
,
bdVector
or ordinary vector.
NULL
then all columns are used.
TRUE
original columns are retained;
otherwise, only the new column is returned.
bdFrame
or
data.frame
(the former if
data
is a
bdFrame
or
bdVector
).
If
copy=FALSE
this contains
a single column with logical values indicating which rows were unique.
If
copy=TRUE
this contains
the new column in additional to old columns (whether included in
the
columns
argument or not).
This function is called by
duplicated
when the
argument is a
bdVector
or
bdFrame
.
The advantage of calling this function directly is that it allows
you to keep the old rows.
x <- bdFrame(a=c(1:2,1:2), b=c(1:3,2)) bd.duplicated(x) bd.duplicated(x, copy=T)