Welcome to the Invelos forums. Please read the forum rules before posting.

Read access to our public forums is open to everyone. To post messages, a free registration is required.

If you have an Invelos account, sign in to post.

    Invelos Forums->DVD Profiler: Contribution Discussion Page: 1 2 3 ...6  Previous   Next
New Actor/Crew Linking System
Author Message
DVD Profiler Desktop and Mobile RegistrantStar Contributorhal9g
Who is John Galt?
Registered: March 13, 2007
Reputation: High Rating
United States Posts: 6,635
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
MODERATORS:  I deliberately did not put this in the "New Features" forum, so that everyone would have a chance to vote.  If you must, feel free to move it.

At the suggestion of Taro and supported by several other users, I am starting this poll to gauge the community's desire for a new actor/crew linking system as described recently in another thread.

Here's how I see it possibly working:

Eliminate the "As Credited" function of the program as well as the CLT.  All credits would be entered WYSIWYT.

Ken would create a "Linking" table which would contain all variations of a person's name.  This table could be initially populated by Ken by extracting the current "As Credited" data from the main database.  This means that all the work done over the past couple of years trying to link names using "As Credited" would be preserved and we would be starting this system with a significant number of names already linked.  This table would become part of the download files whenever you "Update" your local database.

Ken would have to modify the code for the local application such that when you double-click on a name in the cast or crew section, the program would find the exact name that you clicked on in the table (referenced above) and list not just the profiles with an exact match, but profiles with matches to all name variants listed in the table for that name.

The program would link all instances within your local database automatically without making you manually go to each profile and update the credit to include "As Credited" data.

For new links, users would submit "Link Requests" through a separate contribution process.  These requests would require sufficient supporting evidence that the two linked names are in fact the same person.  These "contributions" would be voted on the same way as profile contributions today, and ultimately reviewed by the screeners and approved/rejected as appropriate.

We all know that the current main database is riddled with IMDb data.  A simple linking system wouldn't care if the actual credits in the main database are "As Credited" or not.  It would only need to know that credit A equals credit B equals credit C, therefore, a "cleanup" of the main database is not required to make this system work immediately.

In all fairness, I need to point out one downside of this system.  Take for instance "Al Smithee".  Today, with as credited, "Al Smithee" has been identified to be an alias for more than one person (which is in fact the case) and only the profiles that are associated with the same person will get linked.  In the simple linking system, this cannot be done, and all people associated with the name of "Al Smithee" will get linked together.

Personally, I believe this scenario is so infrequent, that the benefits of a simple linking system far outweigh this one small flaw.  And knowing this community, it is entirely possible that someone here can come up with a solution to that problem, too.

To really put a cap on this, if we collapse the name fields to a single name field, we'd have a near perfect system that would eliminate 75% plus of the arguments in the forum. (I think that's a good thing????).

Please feel free to shoot this full of holes!!! 
Hal
 Last edited: by hal9g
DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
Don't be discommodious
Registered: March 13, 2007
United States Posts: 21,610
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
I've said this for two years, now...it's time for my nap.      

The larger question realtive to simple association is as follows. It is largely a local issue, ideally we would have some form of data sharing...HOW. Secondly, how do we deal with Documentation,I think the current concept using results that are simply based on using the Name variant Tool is to loosey-goosey and ripe for all sorts of erroneous data. We won't need the CLT  but we should absoluteluy have some sort of requirement of documentation to support that NameA=NameB, speaking personally I would anticippate at least two independent references, unless the person has a personal website that can shed light on it. The biggest danger here is with lesser known actors and Crew data, but to take an obvious one, we don't want to see a situation where Alan Hale=Alan Hale, Jr, ignoring  that Alan Hale, Sr. also went by Alan Hale. While this isn't likely for Jr and Sr. the point is that such a thing could easily happen in lesser known actors and Crew data.

Skip
ASSUME NOTHING!!!!!!
CBE, MBE, MoA and proud of it.
Outta here

Billy Video
 Last edited: by Winston Smith
DVD Profiler Unlimited RegistrantStar ContributorArdos
Registered: July 31, 2008
Reputation: High Rating
United Kingdom Posts: 2,506
Posted:
PM this userDirect link to this postReply with quote
I think it's a good suggestion in theory. I've just got a couple of "issues".

Quote:
Ken would create a "Linking" table which would contain all variations of a person's name.  This table could be initially populated by Ken by extracting the current "As Credited" data from the main database.  This means that all the work done over the past couple of years trying to link names using "As Credited" would be preserved and we would be starting this system with a significant number of names already linked.  This table would become part of the download files whenever you "Update" your local database.


Although I can fully understand why you suggest doing that, I think that I would prefer the harder, longer approach and work from scratch for linking. This would remove any and all incorrectly linked entries which we "all" know exist in the database. I think there are plenty of users here who wouldn't mind helping with that. Perhaps those who participate in the beta could start to populate the new system while those who want to wait can. Of course that would mean Ken would need to run two systems for a while a beta & 3.5.1 system.

On reflection, if a change was put in place, regardless of how, we'd need to have two systems for a while to allow users some time to change over.

Quote:
To really put a cap on this, if we collapse the name fields to a single name field, we'd have a near perfect system that would eliminate 75% plus of the arguments in the forum. (I think that's a good thing????).


Again I can see the reasoning but I don't want to have everyone sorted by their first name. To me that just sounds so wrong. Of course if you couple that with the suggestion that's previously been made for a LOCAL sorting field then I'd go with it. I can understand though that some wouldn't want another local only field.
 Last edited: by Ardos
DVD Profiler Desktop and Mobile RegistrantStar Contributorhal9g
Who is John Galt?
Registered: March 13, 2007
Reputation: High Rating
United States Posts: 6,635
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Forget_the_Rest:
Quote:
I think it's a good suggestion in theory. I've just got a couple of "issues".

Quoting hal9g:
Quote:
Ken would create a "Linking" table which would contain all variations of a person's name.  This table could be initially populated by Ken by extracting the current "As Credited" data from the main database.  This means that all the work done over the past couple of years trying to link names using "As Credited" would be preserved and we would be starting this system with a significant number of names already linked.  This table would become part of the download files whenever you "Update" your local database.


Although I can fully understand why you suggest doing that, I think that I would prefer the harder, longer approach and work from scratch for linking. This would remove any and all incorrectly linked entries which we "all" know exist in the database. I think there are plenty of users here who wouldn't mind helping with that. Perhaps those who participate in the beta could start to populate the new system while those who want to wait can. Of course that would mean Ken would need to run two systems for a while a beta & 3.5.1 system.


That's a very good point.  Perhaps Ken could extract the data into a table and have a group of "volunteers" validate it during the beta period.

Quoting Forget_the_Rest:
Quote:

Quoting hal9g:
Quote:
To really put a cap on this, if we collapse the name fields to a single name field, we'd have a near perfect system that would eliminate 75% plus of the arguments in the forum. (I think that's a good thing????).


Again I can see the reasoning but I don't want to have everyone sorted by their first name. To me that just sounds so wrong. Of course if you couple that with the suggestion that's previously been made for a LOCAL sorting field then I'd go with it.


I would fully support a local sorting field.
Hal
DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
Don't be discommodious
Registered: March 13, 2007
United States Posts: 21,610
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Good points Hal and forget.

Skip
ASSUME NOTHING!!!!!!
CBE, MBE, MoA and proud of it.
Outta here

Billy Video
DVD Profiler Unlimited RegistrantStar ContributorAddicted2DVD
Registered: March 13, 2007
Reputation: Highest Rating
United States Posts: 17,329
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
It has my vote Hal! 
Pete
DVD Profiler Desktop and Mobile RegistrantGraveworm
Registered: April 7, 2007
United Kingdom Posts: 357
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Works for me. I'd like to see this form the basis of a Database of cast and crew but this is an improvement over what we have. The only problem is bandwidth as the table woudl need to be present locally and be regularly updated. Not an issue for us and the desktop version but probably for Ken and depending on the size the windows mobile version. Plus of course it would need a re-write of all versions at the same time to make it compatible
which must be headache for the authors and for very little extra return unless they start charging for upgrades, which I would pay, but with all the issues arouind that.
 Last edited: by Graveworm
DVD Profiler Desktop and Mobile RegistrantStar ContributorTaro
Registered: February 23, 2009
Reputation: High Rating
Belgium Posts: 1,580
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
I'm in support for trying out this new system. The current one, even if it would be cleaned up, would still not link correctly due to errors in movie credits and random romanization methods for non-English movies.

I believe the system as described by Hal is the next evolutionary step forward to give the online database even more appeal. Now a lot of users have to do the work twice:
  • input the data with the WYSIWYT system and submit online

  • adapt the local data to correct missing links


  • The system Hal describes fully preserves the current WYSIWYT but adds the correct linking to that. The linking needs only be done once and all users can download it from the online. No more linking correcting individually in the local database.

    It would need some fine tuning and beta testing I think, but it's definitely worth to try, because in my humble opinion, the current linking systems doesn't work (sufficiently) to be an effective tool.

    I'll also add that I would also be willing to pay for such an upgrade, provided the charges aren't exhorbitant of course.
    Blu-ray collection
    DVD collection
    My Games
    My Trophies
     Last edited: by Taro
    DVD Profiler Unlimited RegistrantStar ContributorDanae Cassandra
    Registered: Apr 11, 2004
    Registered: May 26, 2007
    Reputation: Great Rating
    United States Posts: 2,877
    Posted:
    PM this userView this user's DVD collectionDirect link to this postReply with quote
    I like the idea Hal.  It will certainly show an improvement over the system as it currently stands.  I like that it preserves the credits and that it also may provide mostly correct linking.  Like Taro, I'd be all for trying it out - and I'd be willing to pay for the upgrades, also if it wasn't too expensive.

    I just have one question:  How will your proposal handle credits for different people with the same name?

    For example, there are several people named Michael Davis (by my count 11 different crew people and 16 different cast people).  Currently to differentiate any of them from each other I need to find birthdays.  How will your proposal differentiate them?

    (Hopefully this isn't the same as the Alan Smithee problem you have already pointed out.)
    If more of us valued food and cheer and song above hoarded gold, it would be a merrier world.
    -- Thorin Oakenshield
     Last edited: by Danae Cassandra
    DVD Profiler Desktop and Mobile RegistrantStar Contributorhal9g
    Who is John Galt?
    Registered: March 13, 2007
    Reputation: High Rating
    United States Posts: 6,635
    Posted:
    PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
    Quoting Danae Cassandra:
    Quote:


    I just have one question:  How will your proposal handle credits for different people with the same name?

    For example, there are several people named Michael Davis (by my count 11 different crew people and 16 different cast people).  Currently to differentiate any of them from each other I need to find birthdays.  How will your proposal differentiate them?

    (Hopefully this isn't the same as the Alan Smithee problem you have already pointed out.)


    Good question.  The BY concept was supposed to deal with this problem, but as we've found out, finding BY for many is all but impossible, especially crew members.

    If they aren't differentiated, then it will link all profiles for that name, something we need to avoid.

    I don't have a good answer for that one.  Maybe someone else has some thoughts.
    Hal
     Last edited: by hal9g
    DVD Profiler Desktop and Mobile Registrantleo1963
    Registered: May 14, 2007
    United States Posts: 455
    Posted:
    PM this userView this user's DVD collectionDirect link to this postReply with quote
    Since I do a lot of crew/cast contributions I would be in favor of a system like Hal has suggested. 

             

       
    DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
    Don't be discommodious
    Registered: March 13, 2007
    United States Posts: 21,610
    Posted:
    PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
    Hal:

    First question, especially ASkipsince the BY was your idea  and i still think a brilliant one...too bad it didn't work as planned. Should we continue using BY when possible or just lose it altogether. Let's see  we can't use Roman Numerals.

    If we continue using BY data, then there would be times when we can't find a BY but can find a DY...sounds too complex. Plus what if the person isn't dead.

    Assign an alpha "number" A, B, C etc, an alpha numeric or an arabic numeral. Who sets such a system in any of the three and on what bases.

    Pardon me, I am just playing with train of thought, please feel free to offer thoughts on any of these.

    Skip
    ASSUME NOTHING!!!!!!
    CBE, MBE, MoA and proud of it.
    Outta here

    Billy Video
    DVD Profiler Unlimited Registrantnuoyaxin
    prev. known as ya_shin
    Registered: March 13, 2007
    Reputation: High Rating
    Taiwan, Province of China Posts: 3,433
    Posted:
    PM this userEmail this userVisit this user's homepageView this user's DVD collectionDirect link to this postReply with quote
    The one thing that has me a bit concerned is actually reflected by the way one of the options is formulated: "Let's try the simple linking system described below." (bold by me).

    Ken once decided to go in the Credited As direction and he explained that that decision wasn't made lightly. This seems a bit like "oh, we don't like this, let's try something else to check if that works better". And if it doesn't we just try again...?


    BTW, I didn't vote against it either, I am one of those with a desire to nap.
    Achim [諾亞信; Ya-Shin//Nuo], a German in Taiwan.
    Registered: May 29, 2000 (at InterVocative)
    DVD Profiler Unlimited RegistrantStar ContributorDanae Cassandra
    Registered: Apr 11, 2004
    Registered: May 26, 2007
    Reputation: Great Rating
    United States Posts: 2,877
    Posted:
    PM this userView this user's DVD collectionDirect link to this postReply with quote
    If we wanted to keep birth year, I suppose we could use fake birth years when we did not have the true year, using them as placeholders for a time when the true birth year could be ascertained. 

    So we could have:
    Michael Davis (1935)
    Michael Davis (1961)
    Michael Davis (0001)
    Michael Davis (0002)
    etc...

    The only problem would arise at that point is when we get:
    Michael Davis (1961)
    Michael Davis (1961)

    But that is a problem with the current system that has no solution.
    If more of us valued food and cheer and song above hoarded gold, it would be a merrier world.
    -- Thorin Oakenshield
    DVD Profiler Desktop and Mobile RegistrantStar Contributorm.cellophane
    tonight's the night...
    Registered: March 13, 2007
    Reputation: High Rating
    United States Posts: 3,480
    Posted:
    PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
    With the current system in my local db, I can link or separate any credit, whether it's Alan Smithee or the Alan Hales or Harry Stradlings, the screenwriter Arthur Miller and the cinematographer Arthur Miller, etc. Unless this new suggested system could provide that same level of distinction, I wouldn't want to change.
    ...James

    "People fake a lot of human interactions, but I feel like I fake them all, and I fake them very well. That’s my burden, I guess." ~ Dexter Morgan
    DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
    Don't be discommodious
    Registered: March 13, 2007
    United States Posts: 21,610
    Posted:
    PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
    Quoting Danae Cassandra:
    Quote:
    If we wanted to keep birth year, I suppose we could use fake birth years when we did not have the true year, using them as placeholders for a time when the true birth year could be ascertained. 

    So we could have:
    Michael Davis (1935)
    Michael Davis (1961)
    Michael Davis (0001)
    Michael Davis (0002)
    etc...

    The only problem would arise at that point is when we get:
    [b]Michael Davis (1961)
    Michael Davis (1961)[/b]

    But that is a problem with the current system that has no solution.

    Cass:

    Is this REAL of hypothetical. If it's real that fix was anticipated but will have to be included in a new version, we thought the odds were low that such would be found in reality. The fix would be to add BM.

    SKip
    ASSUME NOTHING!!!!!!
    CBE, MBE, MoA and proud of it.
    Outta here

    Billy Video
        Invelos Forums->DVD Profiler: Contribution Discussion Page: 1 2 3 ...6  Previous   Next