Automating data extraction from invoices using Rossum API and UiPath

In this tutorial, you learned how to fully automate the process of data extraction from invoices using UiPath Studio and Rossum API. Rossum’s cognitive data capture technology is unique because it does not require an explicit setup for every invoice template, but is able to find information out of the box. In this example, we’ll start with a directory with PDF files and/or images, and get a CSV spreadsheet containing the values of automatically extracted fields for each invoice.

The tutorial in this post has been superseded as we upgraded our API offering. Please refer to a newer UiPath tutorial built around our modern Document API.

Leave comment

19 comments

  • Avatar

    Indra Rahmat

    is rossum support to Thai language?

    Reply

  • Rossum

    Rossum

    Hello Indra – thank you for your message. We do not currently support Thai language, but are working on adding additional languages. We currently support English, Czech, German (DE) and Slovak languages.

    Reply

  • Avatar

    sandeep kumar

    hello i have specific invoice with different format but all contain Invoice Number,Invoice date. I want to extract these thing from all invoice pdf and want to store into csv how to do that please help

    Reply

    • Rossum

      Rossum

      Hello Sandeep – thank you for your message. Someone from our team will be in touch with you shortly to discuss your use case.

      Reply

  • Avatar

    Paul Pu

    Hello we’d like to read each item line in the invoice, not only general information of id, date, etc. Is it possible?

    Reply

    • Rossum

      Rossum

      Hello Jie, thank you for your message. Someone from our team will be in touch with you shortly to discuss your use case.

      Reply

  • Avatar

    sarmad ali

    can i have a demo please?

    Reply

    • Rossum

      Rossum

      Hello Sarmad, thank you for your message. Someone from our team will be in touch with you shortly to schedule a demo.

      Reply

  • Avatar

    Simon LUI

    Hello, this article is amazing. I would like to know if our eform includes a lot of checkbox and data tables, would mind to provide me more information how about your solution to manage it. Thx a lot

    Reply

    • Rossum

      Rossum

      Hello Simon – thank you for your message. Someone from our team will be in touch with you shortly to provide you with more information.

      Reply

  • Avatar

    Simon LUI

    Hello, typo, I want to say our PDF form includes a lot of checkbox and data tables…..

    Reply

  • Avatar

    San

    I tried to use the UiPath code block and seems there are some issues which I am facing related to UiPath HTTP Request activity ( Error – The request was aborted: Could not create SSL/TLS secure channel .

    So I tried to execute “Example in command line” code which is copy pasted from the developers home page. I am getting the below error in the command prompt as well.

    400 Bad Request
    Bad Request
    The browser (or proxy) sent a request that this server could not understand.

    Currently I am creating a POC along with UiPath activities, so please help me to solve the issues so that I can use this ai solution as part of my demo.

    Thanks
    San

    Reply

    • Avatar

      Petr Baudis

      Hello San, can you make sure a proxy, e.g. within a corporate network, isn’t involved? The easiest way is to run the same command from within the network and from a different network (e.g. via cellphone tethering). If this resolves the issue, you will need to talk to your network admins to enable internet access for you to Elis servers.

      Reply

  • Avatar

    Bruno Dalmazo

    Hi Petr, I’m receiving the same error and I tried at 3 different networks.

    400 Bad Request
    Bad Request
    The browser (or proxy) sent a request that this server could not understand.

    I’m trying using javascript inside my ERP (netSuite) and also at postman, both are with this response.

    var api_Key = ‘***’;
    var url = ‘https://all.rir.rossum.ai/document’;
    var postdata = ‘file url download’;

    var headers = new Array();

    headers[‘Authorization’] = ‘secret_key ‘ + api_Key;
    headers[‘Accept’] = ‘application/json’;
    headers[‘Content-Type’] = ‘multipart/form-data’;

    var response = nlapiRequestURL(url,postdata,headers, ‘POST’)

    Reply

    • Rossum

      Rossum

      Hi Bruno, thanks for getting in touch. Your error appears to in fact have a different cause than San’s – I would wager that it’s because you are not passing the file contents specified as the `file` form parameter in the postdata. But it seems that someone from our support is also already in touch with you – hope you get that issue sorted out quickly!

      Reply

  • Avatar

    Kuenzang Thinley

    I would like to have more information on it, would you guide me more on it.
    Thank you

    Reply

  • Avatar

    rajani

    Nice Blog!! Thanks for sharing..

    Reply

  • Avatar

    Aks

    Hello,

    Can you please let me know that will it work with unstructured PDF’s, JPEG, PNG files as well? Can you please provide me the quick demo for this as well? Many thanks in advance.

    Regards,
    Aks

    Reply

    • Rossum

      Rossum

      Hi Aks!
      Yes, it is possible to set up a demo. Please fill out your details and someone from our team will be in touch. https://rossum.ai/contact/

      Reply