‘The sincerest form of flattery’: How a math professor discovered his work had been plagiarized

Andras Kornai

Not long ago, it came to my attention that a 2016 paper by my students and me, “Measuring Semantic Similarity Of Words Using Concept Networks,”  had been plagiarized, verbatim. The offenders had added two words to the title, which now read: “A Novel Methodology For Measuring Semantic Similarity Of Words Using Concept Networks.” Their article was published in the journal Webology, which has been delisted from Scopus, Elsevier’s abstract and citation database. My first impulse was to ignore the transgression, but I asked the question what to do on a closed mailing list read by former colleagues:

I know that imitation is the sincerest form of flattery, and I’m sufficiently flattered, and I know that even a Harvard President was let go for lesser forms of plagiarism, but Integral University of Lucknow is not exactly Harvard. We may already live in a post-truth world (if Trump gets reelected it’s proof positive that we do) and I don’t quite have it in me to destroy the futures of some random students (or perhaps faculty?) in India. The online journal where it appeared is published in Teheran, and does not appear on Beall’s list of predatory journals. What to do?

The responses ran 10-0 in favor of doing something. Here is a typical one: 

you should alert the journal where it appeared. I’m not familiar with Webology, but if it’s a reputable, peer-reviewed journal, authors would be required to attest that the material is original (or sufficiently sourced with references). My assumption is that the editors will want to know that this article failed their pre-publication review process and should be retracted. Bad behavior and laziness should not be rewarded. You have every right to point out that careful, deliberative, accepted and established processes were not followed in this case. Sometimes we need to be the “check” in checks and balances.

The final point I would make is that it’s better to respond to this situation quickly (sooner rather than later). Otherwise, a long & growing chain of references could develop pointing to the plagiarized article. It can be difficult to retract and correct mistakes, but the problem only gets worse (more entrenched) over time. good luck,

I followed up with the journal, but have received no response so far. I have not followed up with Integral University of Lucknow, whose website simply claims theirs is the “best university in India”. The head of the Department of Computer Application, where both plagiarist authors work, is none other than Mohammad Faisal, second “author.” I could complain directly to him, or perhaps to the first author, Farooq Ahmad, who seems to be in charge of quality assurance for “Teaching Learning & Education” at the department.

So far I have not done so (some people on the mailing list suggested complaining to their superiors, which I haven’t done either). But again, I am not interested in destroying their careers. If they continue to serve at the Best University in India so be it. Someone on the mailing list wrote:

Flattery is one thing, plagiarism is something else again.  From what you describe, this is far beyond flattery.  The author had to knowingly do this and from what you say provided nothing of his own. I consider plagiarism worse than theft. Stolen physical objects can be recovered or replaced. Stealing ideas can’t be.

But I strongly disagree: ideas reproduce at zero cost; a stolen idea is an idea that somehow gets broader circulation. What is stolen really is the credit for the idea (more on this later). Also, consider this reply on the mailing list:

Back in the 1980s and 1990s I found myself asking large groups of computer science undergraduates to write simple essays. We often found that Indian students (in particular) were “plagiarising”. When we confronted the students we discovered that this was truly a symptom of a culture clash. The norm at their home schools was to do research, often as a collaborative group, draw their own conclusions but include supporting evidence directly from published work elsewhere without attribution. They had never been told that this was unacceptable, and in their previous experience it was completely normal. So we very simply gave them a written statement of what was considered appropriate in a US setting, including attribution of included materials and acknowledgement of the work of collaborators. There were no complaints and the problem stopped.

The fact that the journal claims to check for plagiarism but didn’t catch this seems to be a serious problem.  I can’t tell from your description whether they have a statement for submitters to sign that asserts they haven’t plagiarized — some journals do and some just take it for granted. Based on my experience, the “take for granted” approach isn’t wise in an international context.

What makes the whole affair interesting is that it sheds some light on the seamy side of academic publishing. Since citations are the coin of the realm, the tracking of citations is a life and death matter both for clueless university administrators, the Vogons who increasingly rule academia, and for the professors caught in the rat race. Back in the day it was “publish or perish,” and one side effect was the proliferation of second- and third-rate journals that helped desperate authors improve their publication count. My own university is guilty of this practice, publishing such a journal at least since the 1970s, so who is to cast the first stone on Webology? 

By the standard eigenvector method (nowadays better known as Google’s PageRank), scientometry is perfectly capable of detecting the lower- and higher-quality journals, and this is what impact factor should all be about. But as soon as IF became important (e.g. in evaluating “research output” by universities and grant agencies), it became subject to Goodhart’s Law, and Retraction Watch already discussed very significant efforts to game it. Back in the day, this method was monopolized by Thomson Reuters whose annual Journal Citation Report was largely in line with the researchers’ own assessment of journal quality, except for weird effects that were stemming mostly from misclassification (what constitutes a comparable range), from weak control for the size of the field in question, and from selection bias (it was a lot of work to start indexing a new journal). 

This last effect, which hurt journals published in foreign/mixed languages particularly, is by now greatly ameliorated by publication shifting online, which enables better language detection, better citation parsing and, at least in principle, better duplicate detection. Today there is an active effort by Clarivate to combat the gaming of IF (a similar arms race is taking place between Google and the SEOs) and I, for one, expect impact rankings to become better over time.

This is something that we should all welcome, especially as 2nd- and 3rd-tier publications play an important role in the research ecosystem (besides letting Vogons pretend they have genuine academic credentials). Peer review is far from perfect, and sometimes genuine jewels get lost in the rush to the highest IF journals. But this is rare (and there is no suggestion that my plagiarized paper was such) and the system is largely self-correcting by the well-known mechanism of rejected authors systematically working toward getting their paper published in the next-highest tier (and walking down the hierarchy until it gets accepted). Séminaire Delange-Pisot-Poitou is no Annals of Mathematics, but it publishes solid work that needs to see the light of the day. Just as in industry, there is a genuine structural need for the “Mittelstand,” and even for “SOHO,” journals.

This brings me to the real issue: How did I know about the plagiarism? The paper that got plagiarized is a genuine 2nd rater – it got published in the proceedings of a workshop sponsored by what was at the time the premier professional society of my field, the Association for Computational Linguistics. ACL proceedings are nowadays overshadowed by NeurIPS, ICLR, and ICML proceedings, but at the time they had better exposure, and our paper garnered 35 citations according to Semantic Scholar, 49 according to Google Scholar. A truly first-rate paper is much less likely to have been stolen – the risk would be too high. The problem is not so much stealing Le Déjeuner des Canotiers as trying to resell it — no fence in their right mind would touch it. I don’t read Webology, nor do I think I have ever come across a paper that was (originally?) published there. So how did I find out? As I explained on the mailing list:

I’m again working on a project I started 15 years ago, and this required looking at an old paper of mine “Eliminating ditransitives” […] – let’s call this paper “A”. One of the reasons why I haven’t continued with the project was that it had no pickup whatsoever, of the 20 citations (Google Scholar) it gathered in the meantime 19 were by my students, who of course knew it perfectly well that the subject is dear to my heart. Now that Google Scholar was showing one independent citation I was curious to check what the citing paper “B“, had to say. Finally, finally, someone is interested in my brilliant contribution to the hard problem that “A” attacked!

Sadly, no such luck. It was just that the paper in Webology was a verbatim copy of “B”, which cited “A” – the one I really care about –  and in the process professors Ahmad and Faisal lifted the citation together with the content. 

Perhaps the moral is that we should be grateful to the ever-growing citation databases. Making these public would be a lot better than the current state of the art, just as it would be better to have a lively IF competition rather than the monopolistic Clarivate, but that’s another story. Given the current “H-index or bust” mentality, I should be actually grateful to Webology for having lifted what was a self-citation (which my university, quite correctly, does not count towards H-index) to the more valuable “independent citation” category. For better or worse, I am flattered. 

Andras Kornai is a professor at the Budapest Institute of Technology and Senior Scientific Advisor at the SZTAKI Computer and Automation Research Institute.

Like Retraction Watch? You can make a tax-deductible contribution to support our work, subscribe to our free daily digest or paid weekly updatefollow us on Twitter, like us on Facebook, or add us to your RSS reader. If you find a retraction that’s not in The Retraction Watch Database, you can let us know here. For comments or feedback, email us at [email protected].

13 thoughts on “‘The sincerest form of flattery’: How a math professor discovered his work had been plagiarized”

  1. This journal is already discussed on Researchgate in various discussion and questions. It boils down to the following: it is highly unlikely that requests (to retract) towards this Webology journal will be dealt with in any way.
    The journal is either hijacked or no longer managed by a trustworthy party. The website (http://webology.org/about.php ) itself mentions that the current/new publisher (indicating a new management…) is “Info Sci Publisher” (nothing can be found about them on the net, so I doubt whether they really exist).
    According to https://academic-accelerator.com/Publication-Fee/Webology they nowadays charge a fee and looking at the number of papers published (which ‘exploded’ in 2021) it strongly looks like that they started ‘cashing’ (and abusing) their Scopus indexing and accept basically everything (the indexing ended with the discontinuation in 2021 https://www.scopus.com/sourceid/4400151723).
    So, I think cloned (or hijacked) is not the right word since I believe there is no longer a legit and genuine version of this journal. A better description is that it turned into a predatory one.
    PS. According to the latest Scopus source list and more specific the discontinued titles list the last papers that are indexed in Scopus are till: Webology, Volume 18, special issue 1137-1157 and consequently the plagiarized paper is not indexed.

  2. It’s not a matter of complaining, but of informing people – in the first instance, the journal, as you did. That seems like an obligation (in the absence of the type of information which Rob Keller has just supplied, which tends to make that particular issue moot). Apart from that, in general it seems like informing Retraction Watch of such cases for entry into the database is useful, just as a tracking measure, even when it is not something calling for explicit mention on the blog.

    “Bad money drives out good.” These people are filling slots that might otherwise be filled by more competent people. I have one colleague, now retired, who wound up leaving India entirely for similar reasons (he tried actively to raise a question of plagiarism at his own institution). I’d have no compunctions about informing their superiors of their activities. And no expectation that it would be useful, but one never knows.

    That said, the problem is less with individual bad actors, as with the current publishing model. Which, again, takes us back to Rob Keller’s discussion. Others are trying to address that issue, in various ways, but it is not a casual activity.

  3. You might consider creating a post at PubPeer. At least that way the offending paper is flagged, at least those who have the PubPeer browser extension.

  4. 1. This very respected author of the original and plagiarized paper is going too soft on plagiarism and thieves. This behavior actually damages science. What does it even mean that I am not interested in them losing their jobs?! Are *you* responsible for others’ faults? *They* (and not you) had to think about that before plagiarizing.

    2. They will NOT lose their jobs, because their university won’t care about them plagiarizing.

    3. To someone who had justified plagiarism with “cultural norms”: Wake up! It has nothing to do with culture norms and culture clash. Many people knowingly fabricate fake academy degrees; if you see them, you will say “it is their cultural norm to fabricate false academic degrees and immigrate to the US”. So naive.

    It is not as if they innocently don’t know if plagiarizing is bad. They know very well that it is bad, but when caught red-handed, they pretend that it is a cultural norm.

    4. Iranian universities and journals usually won’t answer any emails, especially in English. You will have a much better chance of getting an answer if you ask an Iranian colleague of yours to call them and talk to them in Persian.

  5. Nicolas Chevassus-au-Louis wrote about activities like in his book, Fraud in the lab: The high stakes of scientific research. Wild stuff, some of the examples of fraud and plagiarism that he noted. It’s a good, disturbing read.

  6. (facepalm) The plagiarists didn’t even proofread what they copied. If you look at the literature list, you find gems such as:
    * In˜igo
    * Andra´s Kornai
    * Computational Linguis- tics
    * Parsing Unre- stricted Text.
    etc. They just copied the hyphenation from the original, not bothering to read it or recognizing that the diacritica get lost in the copying. There can’t have been any peer review on this. Definately put a notice on Pub-Peer and use the similarity-texter (https://people.f4.htw-berlin.de/~weberwu/simtexter/app.html) to include some nice screenshots of the wholesale copy & paste job!

  7. I, too, don’t buy the ‘cultural’ defense. Its one thing to let slide a student paper – altho’ worldwide plagiarism standards should be followed with appropriate disciplinary measures – another to let ‘authors’ to misrepresent authorship. As others have noted, prizes, awards, scholarships, grants, and other recognition or reward attend authorship. Perhaps there are cultures in which fraud or theft is accepted without reproach, but worldwide publication shouldn’t tolerate it.

  8. How can we be sure that indeed those two professors from India submitted the plagiarized article to Webology? It could be anybody. It needs to be checked whether they include it in their CV or on their website etc.

      1. Indeed, well spotted Marco. Looking at https://sites.google.com/iul.ac.in/drmohammadfaisal/publications it is hard to believe that, besides the two papers in Webology, life dealt him a bad hand every time when choosing a journal to publish in:
        “Journal of Huazhong University of Science and Technology”
        “TEST Engineering & Management magazine”
        Both listed in the Retraction Watch Hijacked Journal Checker https://retractionwatch.com/the-retraction-watch-hijacked-journal-checker/
        And another questionable choice was “Journal of critical reviews” that turned into a fake/predatory journal and is discontinued from Scopus (in 2021).

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.