Closed richard-jones closed 10 years ago
Some questions for the data model:
Society - is this the same as "provider"? If so, can we continue to use the term "provider" or should we migrate the whole dataset over to "society"?
DM: no. provider is now platform/host/aggregtor. See below. Society is a new field.
Platform/Host/Aggregator - I'm still not totally clear what this is. One of the examples is "Highwire Press"; so this is if there is a service provider who hosts the journal for you? Would "OJS" be another value you might enter here?
DM: yes, OJS would work here. For example: the journal 'Heart', owned by the British Cardiac Society (society), is published by BMJ Publishing Group (publisher) and made available online via HighWire Press (platform).
APCs - at the moment we have author_pays, which is not boolean (unlike the new form); it can contain yes, no, conditional and no information. Should we have a new field for APC, and do we have to migrate author_pays?
DM: retain as Yes/No, as it is in the new form. Ignore/delete all existing data for conditional and no info.
There is a question relating to the BOAI, and then a question about the nature of the license. We could reduce this to one question ("which licence are you using") and infer the BOAI compliance from that. There is space in the data model already for this question, so that's good.
DM: unfortunately not. There are places in the world where CC licenses aren't valid and we have many publishers who don't user CC licenses. So we need to retain both.
There is a question in the form example numbered 41 which has a year placeholder XXXX - what is that supposed to indicate? Is it a fixed year you will decide, or is it dynamic? This question is quite complex, and may benefit from being broken down into two answers: author retains copyright? author retains publishing rights (or similar)
DM: please replace XXXX with 2015. It was a placeholder. I have now split this into two questions.
And some more questions:
DM: no URLs needed.
DM: yes, as above, many publishers do not apply licenses.
DM: have changed the wording on the site and made the URL non-compulsory.
DM: OK but as per above. Not all journals uses licences so we need to try to capture as many angles of this as possible.
DM: this is correct. CL should ignore/delete all existing data in these fields.
DM: are CL able to tell me how many of the journals have a value for this field?
DM: correct
DM: correct
DM: correct
DM: correct
I have created a spreadsheet which starts to map the form to the data model, and also has some notes about how we can aid the user input:
Have updated your comments with our responses. One question for CL: Alternative title is not included in the suggest form DM: are CL able to tell me how many of the journals have a value for this field?
One question for CL: Alternative title is not included in the suggest form DM: are CL able to tell me how many of the journals have a value for this field?
Sorry, didn't see this.
9609 out of 12459 journals are missing an alternative title or have an empty one.
Is there standard terminology for article submission charges? DM: no, there isn't. Lars suggested just submission charges but in your data model you could probably use whatever you wanted.
Write out some scenarios for questions 39 and 41. DM: 1) an author submits a paper to an open access journal. The journal has a CC-BY license for all content. When the paper is accepted for publication, the publisher asks the author to sign a form transferring all copyright and reproduction rights over to the publisher indefinitely. In this scenario, the author might not choose to send her article to this publisher as any activity by the author after publication would be limited. 2) an author submits a paper to an open access journal. The journal has a CC-BY license for all content. When the paper is accepted for publication, the publisher asks the author to sign a form that grants the publisher the right to reproduce the work for one year but all copyright and permissions stay with the author. This is definitely better for the author in this scenario.
Is it confusing that these 3 are lumped together? Should there be subheadings? DM: I have made changes to the form to reflect the different areas.
Do we need alternative title? DM: yes and I have added it into the form.
Due to these changes, the form numbering is now out of order. Please ignore it. I will correct it if you want me to.
For the record, on implementing this form. The questions "From 2015 ..." should all be this year + 1, so next year it becomes "From 2016...." automatically
Further comments from my call with Lars: 3) Will data from the article count (question 13) be something that will be part of a journal's published metadata? DM: yes and will be updated every year. Preferably by the publisher. and then approved by the EdTeam.
4) Shall we include CC0, CC-BY-SA as options for license? DM: we will not include CC0 but CC-BY-SA should be included. I have updated the form already but see below... Lars liked your idea of adding the version number but said that it should not be a compulsory question because many publishers may not know which version of the CC license they have. Does that make sense or is this an unnecessary worry?
5) Confirm question 39 with Lars, especially wording "in accordance with". (Currently, CL will be displaying that if CC-BY selected, CC-BY displayed against journal, even if they have something LIKE a CC-BY.). DM: we take onboard of all CL's feedback here and wondered if the following solution might be possible:
/start/ Does your journal use a CC license or an equivalent thereof? If you do not apply a CC license to your content but have something similar, please go to http://creativecommons.org/choose/ and find out which CC license matches your journal. Left column Right column OR CC-BY CC-BY equivalent CC-BY-SA CC-BY-SA equivalent CC-BY-NC etc etc CC-BY-ND /end/
Then on the site, each journal would display the value of the radio button selected.
Could CL go over their idea again for having publishers choose attributes?
Finally:
For the record, on implementing this form. The questions "From 2015 ..." should all be this year + 1, so next year it becomes "From 2016...." automatically
Actually, we followed CLs advice and have removed this from the question altogether.
Question: for publishers who have many journals, will bulk upload via a spreadsheet upload option be possible?
Sorry, just to clarify: bulk creation of suggestions via a spreadsheet (with ~50 columns)?
Yeh, I was thinking of a way that the publisher might be able to provide a csv file with the info that we could ingest straight in to the database, so they didn't have to use the form 150 times
I have opened a new issue #106 to cover this possibility.
Some comments on the above further feedback
3) Will data from the article count (question 13) be something that will be part of a journal's published metadata? DM: yes and will be updated every year. Preferably by the publisher. and then approved by the EdTeam.
RJ: I think we're talking about slightly different things, so need to clarify. The form says "do you supply article metadata", which I have taken to be a flag (true or false) as to whether the publisher makes stats available. But are you also saying that you want to capture those statistics?
/start/ Does your journal use a CC license or an equivalent thereof? If you do not apply a CC license to your content but have something similar, please go to http://creativecommons.org/choose/ and find out which CC license matches your journal. Left column Right column OR CC-BY CC-BY equivalent CC-BY-SA CC-BY-SA equivalent CC-BY-NC etc etc CC-BY-ND /end/
The things to take into account here are:
I'm still not clear, so we should go through this again on Monday.
DM: Could CL go over their idea again for having publishers choose attributes?
I was thinking something along the lines of:
Choose your licence:
if "Other", does it require:
Attribution [ ] Share Alike [ ] No Commercial Usage [ ] No Derivatives [ ]
From a datamodel point of view, this separates the name of the licence from the type of rights it provides, which will allow us to show in the discovery interface, for example, journals under BY licences whether they are C-BY or some home-grown version.
Take a look at the datamodel for OAG, to see how we would represent this: http://oag.cottagelabs.com/developers/api
RJ: I think we're talking about slightly different things, so need to clarify. The form says "do you supply article metadata", which I have taken to be a flag (true or false) as to whether the publisher makes stats available. But are you also saying that you want to capture those statistics?
DM: yes
what are we actually going to store in the database license value, eg 'cc-by' or 'equiv cc-by' what are we going to present to the end user in the discovery interface license type, eg 'cc-by' or 'equivalent to cc-by'
But yes, let's discuss this and your attribute idea on Monday.
@dommitchell the prototype form has a (single) question
3) ISSN or eISSN *
Shall we make this 2 fields which ask for ISSN and eISSN separately (like in the current edit journal form), or do you only want publishers to give you one of the identifiers? Not much work (for them) to give you both so I'd suggest that.
We also have the optional validation (i.e. one must be present, but it could be either of them. Both missing fails validation though.).
I'm under the impression that this form should be behind a login (i.e. you need an account to suggest a journal). I can't see where this has been recorded and it's not in this issue, so I'm recording it here - correct me if I'm wrong.
We don't know yet - there is currently no way for a publisher to request or create an account, so we need to determine that workflow. The old workflow saw the admins provide the account after the suggestion was made.
On 22 February 2014 17:14, Emanuil Tolev notifications@github.com wrote:
I'm under the impression that this form should be behind a login (i.e. you need an account to suggest a journal). I can't see where this has been recorded and it's not in this issue, so I'm recording it here - correct me if I'm wrong.
— Reply to this email directly or view it on GitHubhttps://github.com/DOAJ/doaj/issues/81#issuecomment-35808050 .
Richard Jones,
Founder, Cottage Labs t: @richard_d_jones, @cottagelabs w: http://cottagelabs.com
Correction: SemperTool created the accounts upon a suggestion being approved for the DOAJ
yes, please make it 2 fields so where possible, publishers can enter both.
On 21 February 2014 at 19:04 Emanuil Tolev notifications@github.com wrote:
@dommitchell https://github.com/dommitchell the prototype form has a (single) question
> > 3) ISSN or eISSN *
Shall we make this 2 fields which ask for ISSN and eISSN separately (like in the current edit journal form), or do you only want publishers to give you one of the identifiers? Not much work (for them) to give you both so I'd suggest that. We also have the optional validation (i.e. one must be present, but it could be either of them. Both missing fails validation though.). — Reply to this email directly or view it on GitHub https://github.com/DOAJ/doaj/issues/81#issuecomment-35756213 .
I was thinking something along the lines of:
Choose your licence:
CC-BY CC-BY-SA CC-BY-NC Other:
if "Other", does it require:
Attribution [ ] Share Alike [ ] No Commercial Usage [ ] No Derivatives [ ]
From a datamodel point of view, this separates the name of the licence from the type of rights it provides, which will allow us to show in the discovery interface, for example, journals under BY licences whether they are C-BY or some home-grown version.
Take a look at the datamodel for OAG, to see how we would represent this: http://oag.cottagelabs.com/developers/api
We are happy with this approach and would like it to be implemented.
Some notes on the new form. (We realise that what is there now is not final. These are only reminders for me and you.)
CC-Zero not to be added.
@dommitchell : review list of licenses that will go on the form as choices
Languages should be controlled vocabulary according to https://docs.google.com/spreadsheet/ccc?key=0AsW3CmDjW93kdGlRU21FNFZSRHhPaUk0ZmtUUHdGNnc&usp=drive_web#gid=0 so a list of all languages is needed.
@emanuil-tolev Record where you took country-codes.json from in the README
@dommitchell : review list of licenses that will go on the form as choices
Please add CC-BY-NC-ND.
Please note the text of questions 19 and 24 has altered slghtly: https://docs.google.com/forms/d/1u0F-fVbEHySkTRXvJLkfd1s_ace9-nTzRYIWdJwSR68/viewform
Please add CC-BY-NC-ND.
Done (not on test site yet, just a note that it's been done)
"Does the journal embed machine-readable CC licensing information in its article metadata?" Please provide an example. For more information go to http://wiki.creativecommons.org/Marking_works
Is the google form missing a field for the example here? Do you want the user to input a URL into an additional text field below this question if they say yes?
(sorry @dommitchell for the duplicate email you'd get - I realised this should have been recorded in github, not in the least since I'll need to reread this whole issue when reviewing the form)
I assume we still want a Captcha at the end of the form. Though it will be mighty annoying after such a long form :).
EDIT: just to clarify people won't lose any data if they fill in the wrong captcha, all they'd have to do is fill it in again and hit submit.
I assume we still want a Captcha at the end of the form. Though it will be mighty annoying after such a long form :).
Yes, please although if you think it is unnecessary, please say!
Is the google form missing a field for the example here? Do you want the user to input a URL into an additional text field below this question if they say yes?
Yes please! If yes, *Please provide an example:'
Yes, please although if you think it is unnecessary, please say!
The big problem is that it's going to time out by the time they reach the bottom of the page. So if we use it, users must click refresh on it before entering it, otherwise it's guaranteed to be wrong.
I'm currently leaning towards a form without a captcha - I'm not sure that bots will be able to submit this form (or want to). The problem is generic bots which roam the internet, not anything targeted at the DOAJ specifically. If we do start getting junk submissions it's probably worth either:
a. using Google ReCAPTCHA with suitable instructions to refresh it (or even refresh it automatically for users, though I doubt this is easy to achieve) b. actually looking about for alternatives - believe me I've done this but there's just nothing that's less than a day of work at least to integrate and test properly. Captcha verification is definitely not a solved problem it would seem...
So for now I'd go for not having it and see if anything bad happens. If so, clean up the mess and put it on temporarily, then pick one of the options above.
Agreed , please proceed without it
(I'll edit this comment rather than make new ones so come to the issue on github for the latest info. This is a list for myself to begin with, but we'll review it on Monday as part of releasing the new form.)
Some comments on the text around the form, taken from the prototype at https://docs.google.com/a/cottagelabs.com/forms/d/1u0F-fVbEHySkTRXvJLkfd1s_ace9-nTzRYIWdJwSR68/viewform :
All questions, except two, are mandatory.
Not sure if this is the case any more, just need to review.
A note about the qualifiers for the DOAJ Seal
Do we still want this section included at all?
(Question 17)
Er, the questions aren't numbered. They could be, but updating the form like that will be painful (have to change all references when adding / removing a question). That's not a great idea considering this form may be modified due to feedback very soon. Something which calculates the number of the question and inserts it dynamically into the text is better, as long as it does not take too long to develop.
What was the first calendar year in which a complete volume of the journal provided online Open Access content to the Full Text of all articles (Full Text may be provided as PDFs
Does this year have to be in the past? Currently max number allowed is current year + 1 (accommodate submissions around New Year's Eve I guess - your call though)
All questions, except two, are mandatory.
Not sure if this is the case any more, just need to review.
DM: no longer needed. Please delete.
A note about the qualifiers for the DOAJ Seal
Do we still want this section included at all?
DM: yes.
(Question 17)
Er, the questions aren't numbered. They could be, but updating the form like that will be painful (have to change all references when adding / removing a question). That's not a great idea considering this form may be modified due to feedback very soon. Something which calculates the number of the question and inserts it dynamically into the text is better, as long as it does not take too long to develop.
DM: yes, I agree. If you could do that, it would be great! We definitely need the explanations there. If this can't be done before rollout, can we add targets in the text instead? e.g
1) have [an archival arrangement in place](hyperlinked to the question) with an external party. If 'None' is selected, the journal will not qualify for the Seal. etc
What was the first calendar year in which a complete volume of the journal provided online Open Access content to the Full Text of all articles (Full Text may be provided as PDFs
Does this year have to be in the past? Currently max number allowed is current year + 1 (accommodate submissions around New Year's Eve I guess - your call though)
DM: no, it can be the current year (for new journals or journals that are newly OA)
Is the license URL absolutely required when the license picked is one of the CC options? What about the "Other" option? I'm guessing it's optional on "No"
DM: no we decided to leave this optional.
If this can't be done before rollout, can we add targets in the text instead? e.g
1) have an archival arrangement in place with an external party. If 'None' is selected, the journal will not qualify for the Seal. etc
Yeah, probably wouldn't try to do it today. And yes, targets will work, all the questions have one - good idea.
DM: no, it can be the current year (for new journals or journals that are newly OA)
Do we allow next year at all then? Like now, should you be able to type in 2015?
Feedback from call with Lars: 1) Archiving, Article identifiers and Deposit policy - not urgent, but a suggestion would be to make the things clickable - that is the have a hyperlink below LOCKSS, CLOCKSS etc - especially the more exotic handle services and deposit policy services are not well known - this should by no means prolong the implementation of the new form
2) Question numbering: we will have to have the questions numbered. DOAJ Team will be communicating to publishers re problems with the form AND we will be translating the form in Help docs so question numbering facilitates that communication. The hyperlinked targets can stay!
3) 'Additionally I noticed that Article Identifiers should behave like the digital archiving policy, i.e. grey checkboxes out, "None" must be the only one selected.' DM: And move "None" to the top. 'Are there any other fields which need to behave like this actually? deposit policy? DM: yes actually. 'With which deposit policy directory does the journal have a registered deposit policy?' And move None to the top.
Additionally, I've put up a new 2-column layout with the signature rounded orange boxes surrounding the sections: http://doaj.cottagelabs.com/application/new . Thoughts?
A couple of the headings (Basic journal information, Information about you) looked really strange when left-aligned, so I've aligned them (just them 2) with the where the text fields start.
If you resize your browser you'll see that the sections stack on top of each other in the correct order. (for smaller screens)
Yes, we like!
Do we allow next year at all then? Like now, should you be able to type in 2015?
No. Just limit to current year.
1) Archiving, Article identifiers and Deposit policy - not urgent, but a suggestion would be to make the things clickable - that is the have a hyperlink below LOCKSS, CLOCKSS etc - especially the more exotic handle services and deposit policy services are not well known - this should by no means prolong the implementation of the new form
OK, sure, though might take an hour or two (a. making the text of a checkbox a link; b. finding a maintainable way of defining those links in the code, we do have 57 questions and lots of checkboxes).
Probably best done with a list of items that you would like hyperlinked, and the relevant URL-s - you know far more than me which ones need a link.
Also, the links can be the standard browser blue or (similar to question numbering below) the DOAJ orange, if needed. I think it's also probably a good idea to open these links in a new tab :) considering you'd otherwise leave the huge form.
2) Question numbering: we will have to have the questions numbered. DOAJ Team will be communicating to publishers re problems with the form AND we will be translating the form in Help docs so question numbering facilitates that communication. The hyperlinked targets can stay!
Actually there was a compounded problem with the 2-column layout - the layout's very nice, but now you've got 2 questions next to each other on the screen. So you can't really tell what to look at when you follow some link to #deposit_policy since there's so much on the screen - it's gonna be at the top of your window but is it on the left or the right?
I thought we should really have a good solution to this, also because you raised the question numbers issue. There should be a GOOD way of linking to individual questions as that will result in much clearer feedback and less frustration. So
What should happen is: at 3. the page should scroll to the question (not snap to the question - scroll visibly so your eyes can follow). Then the question should get highlighted. at 5. the page will open at the position of the question, but it should get highlighted.
Additionally I've added the ability to refer to questions by their numbers. So http://doaj.cottagelabs.com/application/new#25 should work. This doesn't activate the fancy stuff (so it will just snap and not highlight), a. because it's a bit difficult to infer the name of a question from its number at the moment b. because presumably you know what you're doing if you use that link. There's only 2 ways to get it: read the source code of the page (I would do this to random websites) OR you just know about it.
It's obvious what you're looking for anyway, the numbers are on the screen...
3) 'Additionally I noticed that Article Identifiers should behave like the digital archiving policy, i.e. grey checkboxes out, "None" must be the only one selected.' DM: And move "None" to the top.
OK, done.
'Are there any other fields which need to behave like this actually? deposit policy? DM: yes actually. 'With which deposit policy directory does the journal have a registered deposit policy?' And move None to the top.
Done.
As an update: there's a bit more work on the crosswalk as a result of the work on making questions referanceable + numbered. I will make it live tomorrow morning (10-11 UK time) unless there's any other feedback on the front-end.
Of course I'll send an email around to everybody when I do that.
Just to say thanks for the numbering solution! It's great!
Assigning to Emanuil
Some updates on what has been developed between the comments above and now.
Probably best to open new issues about new enhancements to the form or user feedback. I do note we have 2 suggestions on the live site as of now (~2 hours live) so it is at least working :).
Issue 132 opened: https://github.com/DOAJ/doaj/issues/132
New Suggestion Form
Note that the Suggestion form has implications for the data model in DOAJ, and while we will update the data model to handle the additional content, this work item does not include propagating those changes to the public-facing user interface(s).