What is the optimal relational database design for storing an unknown number of similar but unique entities

Question

The database we are designing allows users to authenticate with multiple 3rd party services, mostly social media (twitter, facebook, etc). There will be an unknown and growing number of these services....

Accepted Answer

A) The most direct solution &#8230; JSONYou are right, option A is grossly incorrect.  It breaks Codds&#8217; First Normal Form, thus it is not Relational.  NULL in the database is an indication of incomplete Normalisation, which leads to complex SQL code.  To be avoided at all costs.similar but uniqueTo be clear, that they are unique to the Service is true.  That {LoginName; UserName; Email; UserId; etc} are all similar is true in the implementation sense only, not in the data.I may need to sketch this out.That is a great idea.  A visual data model is far more effective, because (a) the mind can comprehend it much better than text, and (b) therefore work out details; contradictions; missing bits; etc.  Much easier to progress each iteration visually, than with text.Second, we have had visual modelling tools since 1987 (1984 for a closed group), which have been made a Standard in 1993.  Hopefully you appreciate that a standard-compliant model is better than a home-grown or corporate-supplied one.  It displays all technical details rather than a small subset.Is there a name for this strategyIt is plain old Relational Data Modelling, which includes Normalisation (ensuring compliance with Codd&#8217;s Normal Forms, as opposed to the insanity of implementing the NFs is fragmented progressive steps).ObstacleOne problem that needs to be understood and eliminated is this.  The &#8220;theoreticians&#8221; market and propagate 1960&#8217;s Record Filing Systems under the banner of &#8220;relational&#8221;.  That is characterised by a Record IDs in every file.  That method ensures the database remains physical, not logical, the very thing that Codd overcame with his Relational Model: a database that is logical and therefore extremely easy to navigate, by any querying party, current; planned; or unplanned.  The essential difference between 1960&#8217;s RFS and post-1970 Relational Databases is:whereas the RFS maintains references between Files by physical pointer (Record ID), the Relational Database maintains references between Tables by logical Key.A logical Key is &#8220;made up from the data&#8221; as per Codd(A datum that is fabricated by the system is not &#8220;made up from the data&#8221;)(Use of the SQL command PRIMARY KEY does not magically anoint the datum with the properties and qualities of a Relational Key: if you use PRIMARY KEY RecordID you are in 1960&#8217;s physical paradigm, not the post-1970 Relational paradigm)Logical Keys provide Relational Integrity (as distinct from Referential Integrity, which is an ordinary function of SQL), which is far superior to that obtained by 1960&#8217;s RFSAs well as far superior Speed and Power (far less JOINs, and smaller sets)Relational DatabaseTherefore I will give you the answer as a Relational Data Model, as per Codd.Just one example of Relational Integrity:the ServiceProperty FK elements in UserServiceProperty is constrained to PK (particular combination) in ServicePropertya UserServiceProperty row with Facebook.Email is preventedA Record ID based 1960&#8217;s RFS that the &#8220;theoreticians&#8221; promote as &#8220;relational&#8221; cannot do that, various errors such as that one are allowed.All my data models are rendered in IDEF1X, the Standard for modelling Relational databases since 1993My IDEF1X Introduction is essential reading for beginners.The IDEF1X Anatomy is a refresher for those who have lapsed.If you have trouble reading the Predicates directly from the Data Model, let me know and I will produce them in text form.Please feel free to ask questions, the more specific the better.

Advertisement

Answer

Obstacle

Relational Database