Remove Reversed Duplicate Rows from a Data Frame — remove_reversed_dupe

This function removes rows from a data frame where two specified columns have reversed duplicate values. It identifies pairs of columns that contain the same values but in reverse order, and retains only one instance of each pair.

Usage

remove_reversed_dupe_cols(df, col1, col2)

Arguments

df: A data frame containing the data to be processed.
col1: The name of the first column to check for reversed duplicates.
col2: The name of the second column to check for reversed duplicates.

Value

A data frame with reversed duplicate rows removed.

Details

This is useful for cleaning up data where the order of elements in the two columns doesn't matter, such as in undirected networks or paired observations.

Examples

if (FALSE) { # \dontrun{
# Create a sample data frame
df <- data.frame(
  var_1 = c(1, 2, 3, 4, 2) %>% paste0("street_", .),
  var_2 = c(2, 1, 4, 3, 1) %>% paste0("street_", .)
)

# Remove reversed duplicates
cleaned_df <- remove_reversed_dupe_cols(df, "var_1", "var_2")

} # }