Database architecture: When records having different number of attributes (columns)

Question

Suppose I have these records ID 1: has attributes A,B,D ID 2: has attributes B,C ID 3: has attributes F ID 4: has attributes C,G .....(Attributes will not duplicate in the same ...

Accepted Answer

Actually, none of your designs are optimal (the third is the best), and I recommend a single junction table which relates ID valued to their attributes, e.g.ID | attr1  | A1  | B1  | D2  | B2  | C3  | F4  | C4  | GThis is the most normalized approach.  To see why this design is optimal, see how easy it is to find all IDs which have attribute B:SELECT DISTINCT IDFROM yourTableWHERE attr = 'B';It is also fairly straightforward to find all IDs having both attributes B and D:SELECT IDFROM yourTableWHERE attr IN ('B', 'D')GROUP BY IDHAVING MIN(attr) <> MAX(attr);Your first two suggestions would make it much harder to write these queries (give it a try), and in general it is bad practice to store CSV in database tables.  Your third suggestion does store the relationships correctly, but it unnecessarily spreads out data across multiple tables.A more general form of the above query which can easily be extended to any number of IDs is:SELECT IDFROM yourTableWHERE attr IN ('B', 'D')GROUP BY IDHAVING COUNT(DISTINCT attr) = 2;

Advertisement

Answer