Using MATLAB to batch search #citation of IS papers


There is one course which requires us to do research on the methods and theories of some specific period of time. And we need to read all of the papers issued on the top journals in the target period. After briefly browsing, I found that I need to read about 100 papers which is frankly impossible to be done in just two weeks.

Then I just want to know the current citation numbers of this bunch of papers, which can make it easier to decide which one is more worth to read at first. Then I did some work on MATLAB and hope it can help you.Specific Code here

Basic info

  1. Version: 1.0
  2. Copyright: HU WEI
  3. Date: 2017-09-18
  4. Method: Use MATLAB
  5. Environment:
  • OS: Mac OS X EI Capitan
  • MATLAB: R2014b
  • Default Web Browser: Chrome Version 60.0.3112.113 (Official Build) (64-bit)
  • Google Scholar Language: English


  1. google2link.m is built on the basis of Fahad Al Mahmood’s work ‘google.m’, please refer to google by Fahad Al Mahmood on File Exchange
  2. About pdfRead.m, I should totally give credit to Derek Wood. Please refer to Read text from a PDF document by Derek Wood|File Exchange


  1. Make a new folder named PDF files and move the PDF files that you want to search for their academic citations in this folder.
  2. Make sure that your network is open;
  3. This google_citation.m can only recognize 3 journals paper:

</i> Management Information System Quarterly
INFORMS Journal of Computing
Information System Research


  1. Run Google_Citation.m file and you will get:
  • citation.mat data file which stores the title and ciataion # of the
  • Ciatation folder which stores the renamed PDF files in the format as Citation #-Old name.pdf;
  1. If you want to load the data citation.mat file to check the data, you can turn to import_demo.m as reference;


  1. Sometimes the Google scholar need you to verify that you are not robot. Just like the error as follow:

    The server returned the message: “Service Unavailable” for URL,’
    (with HTTP response code 503).

The HTTP response code 503 means that:

The server is currently unable to handle the request due to a temporary overload or scheduled maintenance, which will likely be alleviated after some delay.

You may need to try again later.


</i> Use Text Mining Techniques to recognize ‘title’, ‘author’ and some other segment automatically
Convert to Python
Build an API

-------------End of postThanks for your time-------------
BaoDuGe_飽蠹閣 wechat
Enjoy it? Subscribe to my blog by scanning my public wechat account