An American Editor

June 29, 2015

Lyonizing Word: The Easy Way, Not So Easy

by Jack Lyon

After publishing my last article, Lyonizing Word: We Can Do This the Easy Way, or . . ., Rich Adin, An American Editor, wrote:

As written, your wildcard find and replace reduces four names to three if “et al:” is the ending characteristic. How do you write it so that it can handle any number of names, say up to seven?

Good question, and a nice challenge for a wildcard search. Let’s say we have citations with strings of names like this:

Lyon J, Adin R, Carter TO, Jackson TT, Doe J, Smith K, Winger W, et al: blah blah blah

That’s seven names, but let’s see if we can make a wildcard string that will find any number of names and cut them down to three. My first impression is that this might be difficult or even impossible. But let’s try the following wildcard string:

([!^013]@, ){3}([!^013]@, ){1,}(et al:)

Here’s what that means:

Find any character except a carriage return: [!^013]
repeated any number of times: @
followed by a comma
followed by a space
and enclosed in parentheses to form a “group.”
Do that three times in a row: {3}
Find using the same group: ([!^013]@, )
if it occurs once or more (as indicated by the comma): {1,}
followed by “et al:” in parentheses to form a group.

There’s just one problem: It doesn’t work. And that’s how it often is with wildcards — sometimes you have to fiddle around to get the result you want; trial and error are key. So let’s see if we can find just three instances of text using our group:

([!^013]@, ){3}

That doesn’t work either. What in the world is going on here? Let’s try using the group three times in a row:

([!^013]@, )([!^013]@, )([!^013]@, )

That does work. So why not this?

([!^013]@, ){3}

Could it be that {3} doesn’t apply to the wildcard pattern ([!^013]@, ) itself but to the first instance of text that pattern finds? In other words, would that wildcard string  find the first three names in a citation like the following?

Lyon J, Lyon J, Lyon J, Lyon J, Lyon J, Lyon J, Lyon J, et al: blah blah blah

Sure enough, that works! So we’ve just learned something new about wildcard searches. For clarity, I’ll restate it here:

Specifying how many times to find something (using {3}, for example) doesn’t apply to the wildcard pattern it follows but to the first instance of text that pattern finds.

Unfortunately, that means we need to work out a different approach to our original problem. How about this?

([!^013]@, [!^013]@, [!^013]@, )([!^013]@, ){1,}(et al:)

Here’s what that means:

Find any character except a carriage return: [!^013]
repeated any number of times: @
followed by a comma
followed by a space
repeated three times
and enclosed in parentheses to form a “group.”
Find using the same group: ([!^013]@, )
if it occurs once or more (as indicated by the comma): {1,}
followed by “et al:” in parentheses to form a group.

But no, that doesn’t work either! Why not? Oh, yeah, because of that {1,}. As we discovered earlier:

Specifying how many times to find something (using {3}, for example) doesn’t apply to the wildcard pattern it follows but to the first instance of text that pattern finds.

Well, okay, then. We’ll stop using numbers (such as {1,}) to specify how many times a pattern should be repeated (at least for our current purposes). Let’s try this instead:

([!^013]@, [!^013]@, [!^013]@, )[!^013]@(et al:)

Here’s what that means:

Find any characters except a carriage return: [!^013]
repeated any number of times: @
followed by a comma
followed by a space
repeated three times
and enclosed in parentheses to form a “group.”
Then find any character except a carriage return: [!^013]
repeated any number of times: @
followed by “et al:” in parentheses to form a group.

Well, son of a gun; that actually works. So now we can use the following in the “Replace With” box:

\1\2

Here’s what that means:

Replace everything that was found
with the text represented by group 1: \1
followed by the text represented by group 2: \2

Group 1, you’ll remember, was this:

([!^013]@, [!^013]@, [!^013]@, )

It finds the first three names in our citations. And group 2 was this:

(et al:)

It finds the end of our citations.

And so, finally, we’ve succeeded in fulfilling Rich’s original request:

As written, your wildcard find and replace reduces four names to three if “et al:” is the ending characteristic. How do you write it so that it can handle any number of names, say up to seven?

Sometimes the easy way isn’t so easy. Nevertheless, it’s almost always worth pursuing. In Rich’s case, it reduced his editing time from hours (removing extraneous names by hand) to minutes (removing the names with a wildcard find and replace). It also gave Rich a wildcard search that he can save in his fabulous EditTools software for use with future projects. And it provided a deeper and clearer understanding of how to use wildcard searches.

After all these years of editing, wildcard searching is the tool I rely on the most. I encourage you to invest the time needed to learn to use this tool, which will repay your efforts many times over. A good place to start is my free paper “Advanced Find and Replace in Microsoft Word.”

I hope you’ll also watch for my forthcoming Wildcard Cookbook for Microsoft Word. I’m still trying to find more real-life examples for the book, so if you have some particularly sticky problems that might be solved using a wildcard search, I hope you’ll send them my way. Maybe I can save you some work and at the same time figure out solutions that will help others in the future. Thanks for your help!

Jack Lyon (editor@editorium.com) owns and operates the Editorium, which provides macros and information to help editors and publishers do mundane tasks quickly and efficiently. He is the author of Microsoft Word for Publishing Professionals and of Macro Cookbook for Microsoft Word. Both books will help you learn more about macros and how to use them.


 

Looking for a Deal?

You can buy EditTools in a package with PerfectIt and Editor’s Toolkit at a special savings of $78 off the price if bought individually. To purchase the package at the special deal price, click Editor’s Toolkit Ultimate.

May 6, 2015

Business of Editing: Cite Work Can Be Profitable

A recent “Tip of the Week” at Copyediting Newsletter, “Citing Work: What Do Editors Really Need to Do?” by Erin Brenner, discussed the problem of editing citations. As the article pointed out, “what you do to citations and how long that takes can greatly affect your bottom line.” Unfortunately, the article repeated and reinforced the shibboleth that editing citations is not (and perhaps cannot be) profitable.

As I am sure you have already guessed, I disagree.

The Problem

The problem with references is that too many authors put them together in a slapdash manner, ignoring any instructions that the publisher may have given about formatting. And Ms. Brenner is right that straightening out the author’s mess can be both a nightmare and unprofitable.

Let me step back for a moment. I want to remind you of what I consider a fundamental rule about profitability in editing: the Rule of Three, which I discussed 3 years ago in “The Business of Editing: The Rule of Three.” Basically, the rule is that profitability cannot be judged by a single project; profitability needs to be judged after you have done three projects for a client. Yes, I know that most freelancers look at a single project and declare profitability or unprofitability, but that doesn’t make it the correct measure. Anyway, the reason I raise this here is that it is true that for a particular project, having to edit and format citations can make a project unprofitable. But then so can editing the main text.

I have edited many projects over my 31 years where I wished there were more references and less text because the text was badly written but the references were pristine. References are not the automatic key to unprofitability.

Also part of the problem is not being clear what is your role as editor when it comes to the references. Copyeditors, for example, do not (should not) “fact check” references. When I have been asked to do so, I have clarified what the client really means, because I have no way of knowing if a cite actually supports the proposition to which it is attached. If the client really does mean “fact check,” which has yet to be the case, then I decline the project; I am simply not able to devote the time needed to read the cite and determine if it supports the author’s proposition and the client is not prepared to pay me to do so.

The copyeditor’s role is to conform the format of the cites to the designated style and to ensure the cite is complete. Whether the editor is supposed to complete the cite is a matter of negotiation. In my case, I limit that responsibility to a quick look at PubMed. If the cite isn’t readily found there, a quick author query is inserted and it becomes the author’s responsibility. I use EditTools’ Insert Query macro (see “The Business of Editing: The Art of the Query“) and selecting a prewritten query to insert so that a comprehensive query can be inserted within a couple of seconds. One example query is this:

AQ: (1) Please confirm that cite is correct. Unable to locate these authors with this article title on PubMed. In addition, PubMed/NLM Catalog doesn’t list a journal by this name. (2) Also, please provide the following missing information: coauthor name(s), year of publication.

It is much quicker to select a prewritten query than to write it anew each time.

One Solution

Cite work can be very profitable. As with most of editing, whether it is profitable or not often comes down to using the right tool for the job at hand.

I just finished working on a chapter (yes, a single chapter in a 130-chapter book) that is 450+ manuscript pages of which about 230 pages are citations. In fact, there are 1,827 cites for the chapter, and all the journal cites (roughly 1,800 of the references) were similar to this:

6. Jackson, S.P., W.S. Nesbitt, and E. Westein, Dynamics of platelet thrombus formation. J Thromb Haemost, 2009. 7 Suppl 1: p. 17-20.

7. Roth, G.J., Developing relationships: arterial platelet adhesion, glycoprotein Ib, and leucine-rich glycoproteins. Blood, 1991. 77(1): p. 5-19.

8. Ruggeri, Z.M., Structure and function of von Willebrand factor. Thromb Haemost, 1999. 82(2): p. 576-84.

when they needed to be like this:

6. Jackson SP, Nesbitt WS, Westein E: Dynamics of platelet thrombus formation. J Thromb Haemost 7 Suppl 1:17–20, 2009.

7. Roth GJ: Developing relationships: Arterial platelet adhesion, glycoprotein Ib, and leucine-rich glycoproteins. Blood 77(1):5–19, 1991.

8. Ruggeri ZM: Structure and function of von Willebrand factor. Thromb Haemost 82(2):576–584, 1999.

As you can see by comparing what the authors provided and what the book style was, a lot of work needed to be done to go from the before to the after. Conforming 1,800 references the standard/usual way editors do this type of work — that is, manually, period by period — could take many hours and thus be a losing proposition — or by using the right tools for the job, it could take a few hours and be a money-making proposition. I was able to conform the references in less than 4 hours and for 3.5 of those 4 hours, I was able to do other editing work while the references were being conformed.

How? By using the right tools for the job, which, in this case, was EditTools’ Wildcard Find & Replace and Journals macros, which were topics of recent essays (see “The Business of Editing: Wildcarding for Dollars” and “The Business of Editing: Journals, References, & Dollars,” respectively).

[There is an important caveat to the above: I was able to conform the references in less than 4 hours because I already had my datasets built. Over the course of time, I have encountered these problems and I have added, for example, scripts to my Wildcard dataset and journal names to my Journals dataset (which now has 78,000 entries). If I didn’t already have the scripts, or if I had fewer scripts that would address fewer problems, it would have taken me longer. But a professional editor tries to plan for the future and the key to successful use of a tool is the tool’s ability to handle current-type problems in the future.]

To clean up the author names and the cite portion (i.e., 1991. 77(1): p. 5-19) I used EditTools’ Wildcard Find & Replace Macro. Because it lets me write and save a find-and-replace string and put multiple strings together in a single “script,” with the click of a button I was able to run several dozen macros that cleaned up those items. In addition, EditTools’ Page Number Format macro let me change partial page ranges (e.g., 110-19) to full page ranges (e.g., 110-119) automatically. It took less than 15 minutes for the full reference list to be conformed and should I face a similar task next week, I already have the necessary scripts; I just need to load and run them.

What took the most time was fixing the journals. My journals dataset is currently 78,000+ entries and the Journals macro has to run through 1,827 references 78,000+ times. But what it does is fix those incorrect entries it finds in the dataset and highlights them; it also highlights (in a different color) those journal names that are correct. What that means is that I can see at a glance which cites I need to check (in this case, just a handful). And while the EditTools Journals macro is running in the background, I can continue editing other files – which means I am getting paid twice (because I charge by the page, not the hour).

Is it Profitable?

Do I earn money on this? Yes, I do. Consider this example (the numbers have no relevance to what I actually charge; they are an example only): If I charge $3 per manuscript page and the references constitute 230 pages, it means the cost to the client is $690 regardless of whether the references take me 1 hour or 50 hours. In this case, to conform the references took about 4 hours. For those 4 hours, I earned $172.50 an hour as an effective hourly rate. The reality, of course, is that I still had to look over the references and lookup a few, and I actually spent  7 hours on the references altogether, which means my effective hourly rate would be $98.57 at $3 per page. (Had I charged $25 an hour, I would have earned just $25 an hour, approximately one-quarter of the per-page rate earnings, which is why I prefer a per-page rate.) As you can calculate, at a different per-page rate, the earnings would have been higher or lower.

And that doesn’t count what I earned while continuing to edit as the Journals macro ran in the background.

My point is that using the right tools and the right resources can make a difference. I do agree that if I had to fact check each reference, I would not have made any money at a per-page rate (nor at an hourly rate because no client would pay for the time it would take to fact check 1,827 references — especially when this is only one of 130 chapters), but then I wouldn’t have done the work at such a rate (or at all). Whether a task is profitable depends on many factors.

The notion that editing references cannot be profitable is no more true than is the notion that editing text is always profitable. Editing references may not be stimulating work, but with the right tools it can be profitable. The key to profitable editing, is to use the right tool for the job.

Richard Adin, An American Editor

_____________

Looking for a Deal?

You can buy EditTools in a package with PerfectIt and Editor’s Toolkit at a special savings of $78 off the price if bought individually. To purchase the package at the special deal price, click Editor’s Toolkit Ultimate.

Blog at WordPress.com.