Finding similarity across columns using cluster variables