Welcome to the Invelos forums. Please read the forum rules before posting.

Read access to our public forums is open to everyone. To post messages, a free registration is required.

If you have an Invelos account, sign in to post.

    Invelos Forums->DVD Profiler: Contribution Discussion Page: 1 2 3 4 5 6 7  Previous   Next
Contributing 'common names' with suffixes
Author Message
DVD Profiler Unlimited RegistrantStar Contributormdnitoil
Registered: March 14, 2007
United States Posts: 1,777
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Quoting hal9g:
Quote:
Where are all the "database gurus" who should understand data normalization?

I've already started to implement this locally.  I'm in my early 40's now and I figure I'll be dead before the online database is ever properly cross-referenced for cast and crew.  I'm just doing my own and normalization is the first thing I've started as it's the easiest.  Of course now I no longer get to contribute cast/crew updates but I'm not going to wait around for everyone else to figure it out.

For what it's worth, I do use "credited as" locally as I do believe in repeating film credits, but I just don't have the time or energy to bother justifying every little comma to the online voters.  If anybody is actually serious about linking their local cast/crew, you'll almost inevitably freeze yourself out of being able to contribute them.  Ironic.
 Last edited: by mdnitoil
DVD Profiler Desktop and Mobile RegistrantStar Contributorm.cellophane
tonight's the night...
Registered: March 13, 2007
Reputation: High Rating
United States Posts: 3,480
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting mdnitoil:
Quote:
Quoting hal9g:
Quote:
Where are all the "database gurus" who should understand data normalization?

I've already started to implement this locally.  I'm in my early 40's now and I figure I'll be dead before the online database is ever properly cross-referenced for cast and crew.  I'm just doing my own and normalization is the first thing I've started as it's the easiest.  Of course now I no longer get to contribute cast/crew updates but I'm not going to wait around for everyone else to figure it out.

For what it's worth, I do use "credited as" locally as I do believe in repeating film credits, but I just don't have the time or energy to bother justifying every little comma to the online voters.  If anybody is actually serious about linking their local cast/crew, you'll almost inevitably freeze yourself out of being able to contribute them.  Ironic.

My thoughts have gone in this direction this weekend for myself too. I've been trying to update cast and crew for the online db and have been trying to identify and document linked names. I can say after using the Invelos lookup tool that's it's a very hard road. Almost every credit can be found under all variants for a name, so the figures are not reliable.

I have had many circumstances where the most frequent credit in the Invelos database can be shown to be the least frequent after one factors out duplicate titles (each variation of a title counts as one, including each language that it might be in or any stray description that might be included in a title) and known credit errors. (For example, you can subtract the count of Moviie A and all of its name variants from Credit 2 if you can prove that Credit 1 is the 'as credited' credit for Movie A.)

Using the Invelos lookup figures at face value without factoring out duplicates and errors will get you 'no' votes. Trying to factor out the errors in the Invelos database can take up to an hour per name. And then you can get 'no' votes when your contribution notes spill over the available length of notes shown in the evaluation screen. 

On top of this, the Invelos data is heavily (and I mean HEAVILY) corrupted by the choices that IMDb has made for their common names.  Their choice is almost always the name with the most credits in the Invelos lookup.

This means that if you can't prove a credit wrong (if it's not in your collection, for example), you will likely have to use an incorrect "most frequently credited" name today...only to have to correct it later when (if ever???) all of the other regions and localities of your title get changed to 'as credited' from IMDb data.

Why do I want to do this twice?

And I'm not talking about doing this for a few names or for just the ones for suffixes. Pretty much any name you input into the lookup is loaded with incorrect credits, multiplied several times over by language and other title variants.

I struggled yesterday with the name Robert (F.) Boyle, an art director whom I believe is getting a lifetime Oscar at this year's awards. IMDb picked Robert F. Boyle. Invelos is 64-52 in favor of Robert F. Boyle. Yet most of his credits appear to be just Robert Boyle. I'm still not done analyzing the duplicate data to see if we can go with Robert Boyle (I only have 9 of his credits, 8 of which are credited as Robert Boyle). If not, the task will be to change his name twice: once to Robert F. Boyle and a second time back to Robert Boyle.

Twice doesn't sound like it's that bad until you think that it's 550 profile corrections (2 times 225 profiles for the 116 credits). Of course, about half way through changing them to Robert F. Boyle, the numbers will swing toward Robert Boyle and we'll do a U-turn. So maybe only two or three hundred corrections are necessary, half of which would be reversing entries.

Back to the topic of standardization of names, it sure would be easier from my perspective to have set standards for certain types of names (suffixes, accents) so that such names would not have to be changed twice.

Also, regardless of whether one thinks it's easier to change names based on the Invelos lookup or to change names based on a standard, one point that I think gets lost in this debate is that the name database is a database of its own. When I look in my name database and see some names with commas and some without or some names with accents in the "Credited As" and some with accents in the Name, it looks sloppy to me.

For a consistent integrity of the names database, it would seem best to me to have standards for such names. For those who value that consistency over expediency, the ease of using the Invelos figures isn't appealing. And I would argue that there isn't more ease using the Invelos database than there would be to have name standards. I could throw a comma into a name a lot more quickly than I could determine which variant is most credited....with certainty.
...James

"People fake a lot of human interactions, but I feel like I fake them all, and I fake them very well. That’s my burden, I guess." ~ Dexter Morgan
DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
Don't be discommodious
Registered: March 13, 2007
United States Posts: 21,610
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting hal9g:
Quote:

Where are all the "database gurus" who should understand data normalization?


This one is tired of the discussions. While normalization (standardization) is not part of this system, there are possibilities in some rare instances such as rthat noted in this very OLD poll. The use of the comma is simply because that is the more common usage and for example Elisha Cook Jr.(style) is used by another database, there fore there is no possible issue of any kind with that particular company.

Skip
ASSUME NOTHING!!!!!!
CBE, MBE, MoA and proud of it.
Outta here

Billy Video
DVD Profiler Unlimited RegistrantStar ContributorDaddy DVD
Lost in Translation
Registered: March 14, 2007
Netherlands Posts: 2,366
Posted:
PM this userVisit this user's homepageDirect link to this postReply with quote
The credit lookup is not reliable and the reliability of the most credited form in your local database depends on the number of owned titles with a questioned name. So both gives you no certainty what so ever if any name is the most credited one or not. That leaves only one solution and that's standardization. I'm using this for my local database and if I'm unable to share this with others, so be it, I lock my profile and move on to the next. Lucky for me the users of the Dutch locality are not so inflexible as others.
Martin Zuidervliet

DVD Profiler Nederlands
DVD Profiler Unlimited RegistrantStar Contributormdnitoil
Registered: March 14, 2007
United States Posts: 1,777
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Martin_Zuidervliet:
Quote:
The credit lookup is not reliable and the reliability of the most credited form in your local database depends on the number of owned titles with a questioned name. So both gives you no certainty what so ever if any name is the most credited one or not. That leaves only one solution and that's standardization. I'm using this for my local database and if I'm unable to share this with others, so be it, I lock my profile and move on to the next. Lucky for me the users of the Dutch locality are not so inflexible as others.

Region 1 has it's advantages because there's a mountain of data, but that's balanced by the fact that there is a much larger audience to satisfy.

For the record, I'm not trying to come off as sour grapes or I'll just take my ball and go home.  I'm just trying to apply the mantra of "keep it local" if I don't agree with the online.  It's unfortunate, but the minute you attempt to link cast/crew, you automatically stop agreeing with the online.

I too have noticed that the common name lookup tool is fatally flawed, so I'm not about to use those results for my local database.  If I wanted results like that, I could just throw a dart in a board.
DVD Profiler Unlimited RegistrantStar ContributorAddicted2DVD
Registered: March 13, 2007
Reputation: Highest Rating
United States Posts: 17,321
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
I would accept standardization if Ken changes his mind and puts the exceptions into the rules... I wouldn't personally agree with it... but I would accept it.

But Ken was clear with the rules on when to use the "Credited As" field... and he clarified fine on what is meant to be used. And like he himself said... the best way to fix the problem with the credit look-up tool is to start fixing all the credits so that the correct credit is being used for all profiles. As I said... until then I will always vote no to any type of standardization.

So I still say the best thing we can do is correct the online database so that the credit look-up tool is corrected.
Pete
DVD Profiler Unlimited RegistrantStar Contributormdnitoil
Registered: March 14, 2007
United States Posts: 1,777
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
I hear you Pete, and when I come across something obviously wrong I will be only too happy to submit it.  That being said, since I have no control over the umpteen different region releases of the same movie with the same bad data copied across them because it's easier than redoing the profile, I'll be dead and buried before the online is actually clean.  It's even to the point where the same garbage is being copied over to re-releases because, hey, why do all that work again?  The original got accepted, didn't it? After all, it's the same movie...not like the cast changed. 

I would have a lot more faith in the system correcting itself, if when I submitted a credit change, it rippled to all profiles for that movie.  Unfortunately it doesn't work that way and I recognize there could be unforseen consequences even if it did.

Regardless, I will do my small part but remain skeptical.
 Last edited: by mdnitoil
DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
Don't be discommodious
Registered: March 13, 2007
United States Posts: 21,610
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
I hear you midnit and I am with you. But let's face it, we are all lazy (me too) I used to copy data from somewhere with the best of them, but not any more obviously. And i am all too aware of the various forms of corruption in the database.

Skip
ASSUME NOTHING!!!!!!
CBE, MBE, MoA and proud of it.
Outta here

Billy Video
DVD Profiler Desktop and Mobile RegistrantStar ContributorAiAustria
Profiling since 2004
Registered: May 19, 2007
Reputation: Highest Rating
Austria Posts: 5,704
Posted:
PM this userDirect link to this postReply with quote
Quoting skipnet50:
Quote:
Elisha Cook, Jr.

Skip


Why?
Complete list of Common Names  •  A good point for starting with Headshots (and v11.1)
DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
Don't be discommodious
Registered: March 13, 2007
United States Posts: 21,610
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
I have answered this many times, Ai, even this afternoon in this very thread. I don't mean to be rude, but I am not going to reepeat myself again.<frustrated>

Skip
ASSUME NOTHING!!!!!!
CBE, MBE, MoA and proud of it.
Outta here

Billy Video
DVD Profiler Desktop and Mobile RegistrantStar ContributorAiAustria
Profiling since 2004
Registered: May 19, 2007
Reputation: Highest Rating
Austria Posts: 5,704
Posted:
PM this userDirect link to this postReply with quote
Quoting skipnet50:
Quote:
I have answered this many times, Ai, even this afternoon in this very thread. I don't mean to be rude, but I am not going to reepeat myself again.<frustrated>

Skip



First of all, for me, this answer feels rather rude.

I read this thread - last not least, since a have a similar problem here in R2; but the only clarification you condenscended to leave here, was:

Quote:
This one is tired of the discussions. While normalization (standardization) is not part of this system, there are possibilities in some rare instances such as rthat noted in this very OLD poll. The use of the comma is simply because that is the more common usage and for example Elisha Cook Jr.(style) is used by another database, there fore there is no possible issue of any kind with that particular company.


With my limeted knowledge of the english language I can read two things out of this:
a) you are convinced that the comma is common usage
b) IMDB does not use a comma which hardens your opinion.

You did not give any profound reason, why the comma should be used.

My researches present a clear preference of "Elisha Cook Jr." both on the web and on the credit lookup tool (which has to be referenced to fulfill the rules).

And that's why I take you posting rather rude, since I can't see the reason for your frustration, except if the fact that not everyone shares your opinion leads up to frustration.

Regards, AA
Complete list of Common Names  •  A good point for starting with Headshots (and v11.1)
DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
Don't be discommodious
Registered: March 13, 2007
United States Posts: 21,610
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Ai, this is an old discussion. IMDb has made a stylistic choice, we are also well aware that our database is still chock of illegal data, as has been discussed by other users. The poll is clear on the answer by an overwhelming vote...'nuff said, I hope. We are simply making the same kind of stylistic choice that IMDb has made but ours is different. As for the comma being common usage...it is in the States, and I do have some personal knowledge on this.

Personally I have far bigger issues with the Alias system than stylistic ones. I note far too many users making assumptions that just because two names are similar then they are the same person. This is simply an assumption and blatantly untrue. There are many father son and probably mother daughter combos. We NEED to document these things. It is just a matter of time before somebody is going to uncover one these assumptions and discover that it was a false assumption, Bob may not be the same person as Robert. I know documentation at this point is not called for in the Rules, but it SHOULD bve and hopefully this WILL change at some point. But IF we are interested in accuracy then why are we making assumptions without backing them up. This is one of the reasons I am not making larger use of the Alias system at this time, there are some others as well, but at this time I am largely not prepared to do the research that is needed. So come on everybody, let's do this right, while it is still early...PLEASE.

Skip
ASSUME NOTHING!!!!!!
CBE, MBE, MoA and proud of it.
Outta here

Billy Video
 Last edited: by Winston Smith
DVD Profiler Unlimited RegistrantWhite Pongo, Jr.
No, I iz no Cheshire Cat!
Registered: August 22, 2007
Reputation: High Rating
Posts: 1,807
Posted:
PM this userDirect link to this postReply with quote
Quoting Addicted2DVD:
Quote:
I for one have no problem with something changing over time. as the database gets corrected and if the common name changes... so be it.  Then we just fix it to the new common name.


I usually have no problem with that system either, except that fixing the database to the new common name takes effort and time, so I wonder: is that effort worth it in this specific case?

Let's see.

If an actor changes name because of marriage, and is constantly credited by that name, then I can see a very good reason to fix the database.

If an actor chooses to drop his middle name, there I see another good reason for changing the data, as Joe Doe might be not the same person as Joe H. Doe, so it's important to distinguish.

If an actor is referenced by stage name, not by his real name anymore, then I see a very good reason to update the data, because that actually is a name change.

But, sorry, I don't think the same applies to a comma before the suffix. That's just punctuation, written according to a stylistic choice, as Skip put it, but doesn't really change the name.
The presence of the comma is not used to tell a person from another person, AFAIK.
Would you say that "Joe Doe, Jr." is not the same person as "Joe Doe Jr." unless documented? I doubt that. 

So, in this very specific case, I think an exception is justified and standardization would be useful.
By the way, it seems to me that something like that applies to roles: whenever we turn a plural into a singular, we are not really changing the role name, we are just applying standard grammar rules to that role name. 
-- Enry
 Last edited: by White Pongo, Jr.
DVD Profiler Unlimited RegistrantStar ContributorAddicted2DVD
Registered: March 13, 2007
Reputation: Highest Rating
United States Posts: 17,321
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
To me it is not now... or ever was about that... but about what the rules allows... and how Ken clarified it. Those 2 things do not allow standardizing at this time. I never said a single thing about documenting the person is who we think it is in this case... just that going by the rules and Ken's clarification we can not use any type of standardization. And that I for one will always contribute  and vote per the rules. No matter the case.

I personally believe the same way Ken said to start with... lets get the credited names correct in the database so that the look-up tool does work correctly as it is supposed to do.

Now... if Ken wants to decide to make an exception in this case... well that is totally up to him... and I will follow it just as vigorously as I follow what the rules says now. As I do with every one of his rules whether I agree with them or not.

Bottom line is standardization is not within the rules as of right now... so we can not apply them for the online database... no matter if we agree with it or not.
Pete
DVD Profiler Unlimited RegistrantStar ContributorT!M
Profiling since Dec. 2000
Registered: March 13, 2007
Reputation: Highest Rating
Netherlands Posts: 8,667
Posted:
PM this userDirect link to this postReply with quote
Quoting Addicted2DVD:
Quote:
Bottom line is standardization is not within the rules as of right now... so we can not apply them for the online database... no matter if we agree with it or not.

I really don't think of this as a form of "standardization" - like Skip indicated a few posts earlier, I prefer to think of it as a "stylistic choice", and I very much like to have that bit of consistency. I see absolutely no merit in treating one Jr. suffix differently then the other, just because the first happens to be credited WITHOUT a comma six out of ten times, and the other happens to be credited WITH a comma six out of ten times. The stylistic choice of entering suffixes in a consistent manner, of which it is obvious that a vast majority of the users wants it, and even already uses it, seems totally obvious to me...
DVD Profiler Unlimited RegistrantWhite Pongo, Jr.
No, I iz no Cheshire Cat!
Registered: August 22, 2007
Reputation: High Rating
Posts: 1,807
Posted:
PM this userDirect link to this postReply with quote
Quoting T!M:
Quote:
Quoting Addicted2DVD:
Quote:
Bottom line is standardization is not within the rules as of right now... so we can not apply them for the online database... no matter if we agree with it or not.

I really don't think of this as a form of "standardization" - like Skip indicated a few posts earlier, I prefer to think of it as a "stylistic choice", and I very much like to have that bit of consistency. I see absolutely no merit in treating one Jr. suffix differently then the other, just because the first happens to be credited WITHOUT a comma six out of ten times, and the other happens to be credited WITH a comma six out of ten times. The stylistic choice of entering suffixes in a consistent manner, of which it is obvious that a vast majority of the users wants it, and even already uses it, seems totally obvious to me...


I agree with you, but I also believe that a clarification by Ken would avoid further discussions and possibly No votes.
-- Enry
    Invelos Forums->DVD Profiler: Contribution Discussion Page: 1 2 3 4 5 6 7  Previous   Next