Finding unique number of IDs in multiple groups

Question

I have a dataset that has doctors and the various practices they work in. Each doctor in my dataset works in at least 1 practice but as many as 17 different practices. I would like to know the unique ...

Accepted Answer

A self join within group excluding self-pairings will generate a table of all pairings for each group.  Use that concept as the basis for counting the distinct &#8216;partner&#8217;s for each doctor over all groups.  For true uniqueness, be sure you are using a doctorId distinct to each individual.  Attempting to prevent a &#8216;self-pair&#8217; based on a name is asking for trouble.  (Consider a fictitious group with doctors Dewey, Dewey, Dewey, Dewey and Dewey &#8212; yeah trouble)data have;input doctor $ group $;datalines;A P1E P1C P2B P2E P2A P3D P3E P3E P5A P5;run;proc sql;  * demonstrate the combinatoric effect of who (P) paired with whom (Q) within group;  * do not submit against the big data;  create table works_with_each as  select     P.doctor as P  , Q.doctor as Q  , P.group   from have as P  join have as Q    on P.group = Q.group     & P.doctor ^= Q.doctor  order by    P.doctor, Q.doctor, P.group  ;   * count the distinct pairing, regardless of group;  create table works_with_counts as  select     P.doctor as P  , count(distinct Q.doctor) as unique_work_with_count  from have as P  join have as Q    on P.group = Q.group     & P.doctor ^= Q.doctor  group by    P.doctor  order by    P.doctor  ; EachUnique other in Pair (Works with) Counts

Advertisement

Answer