8 Replies Latest reply on Feb 4, 2011 10:06 AM by Sean Slattery

    problem with a particular url for a pdf

    Sean Slattery

      Hi,

       

      The following link is launched from an email confirmation of a online event/ticket system. When the traffic is direct, IE8 receives the pdf file. When sent via MWG702, IE just generates a generic "cannot display the webpage." If I try using Firefox 3.6.13 via the MWG, there is an authentication prompt which never succeeds in loading the file.

       

      I suspect that there is a web script running that pulls the file from another site. I was thinking that enabling rules tracing might help determine the site dependencies.

       

      http://www.eventbrite.com/print-ticket/24067927/31056473/24067927-31056473-ticke ts.pdf?c=OTgzNzI4Mw%3D%3D%0A

       

      TIA

       

      Regards,

      Sean

        • 1. Re: problem with a particular url for a pdf
          Sean Slattery

          I had initially looked through the connection tracing logs and thought that there might be an issue with one of my media filters, but disabling these did not change anything.

           

          It now looks like the problem relates to a script requiring authentication. All of our proxy traffic requires authentication. When I disable the authorize and authenticate ruleset, the file downloads successfully. I can't pinpoint what script is running or what additional site is being referenced.

          • 2. Re: problem with a particular url for a pdf
            Sean Slattery

            Actually I was wrong about the authentication. Even if I disable every policy/rule, I cannot download the file.

            • 3. Re: problem with a particular url for a pdf
              dstraube

              Hello Sean,

               

              if you disable every rule and still can't download the file it is either a network problem or a problem of the proxy implementation. The network traces might help here, especially the server communication (MWG <-> Webserver) would be important.

               

              I've tested the download with 7.0.2.2.0 (9451) and I can download the file. If you have an older build you might want to upgrade. 7.0.2.2 includes several fixes for not quite RFC conform header formats and bug fixes for chunked encoding transfer. As this server is using chunked encoding and the file is fairly small this might be such a problem.

               

              Regards,

               

              Dirk

              • 4. Re: problem with a particular url for a pdf
                Sean Slattery

                Yes, I am using 7.0.2.2.0 (9451). I have two completely separate domains/networks each a VMware based Proxy HA implementation. I deployed all the MWG's so they should be identical which matches the behavior on both networks. I will investigate further.

                • 5. Re: problem with a particular url for a pdf

                  Sean,

                   

                  I tried the URL you posted and recieved the same results you did - page cannot be displayed within IE, and an auth prompt in Firefox.  Upon some further testing, I bypassed authentication for my PC on the WG7.  Still recieved the page cannot be displayed within IE but the auth prompt in Firefox went away.  I then put *eventbrite.com in a URL.Host whitelist for the Media Type Filters - same results.  Finally, I put an  *eventbrite.com entry into the URL.Host whitelist for into the AV scan ruleset.  Once I did that, the file downloaded as expected.

                   

                  Just thought I'd offer that up so you know that it can work.

                   

                  Steve

                   

                   

                  Message was edited by: importminded on 2/3/11 7:20:26 AM CST
                  • 6. Re: problem with a particular url for a pdf
                    dstraube

                    Hello,

                     

                    I've been able to reprduce this on a different machine and it seems server related. The same file downloaded from a different server works fine. The Gateway Antimalware ruleset has an effect, when it is disabled the download worked. I can't see an error in the antimalware engine directly and currently I assume a problem with the request passing from the proxy towards the AV engine.

                     

                    I've all the network traces I need and forwarded this to development. This needs to be debugged I guess.

                     

                    Regards,

                     

                    Dirk

                    • 7. Re: problem with a particular url for a pdf
                      dstraube

                      Already received feeback from development. It's a bug on the server. There are several bugs actually.

                       

                      In case you can download the file, look at the end of the document with an editor (hex-editor) and you will notice a HTML file. This shouldn't be. The server sends the content and then a HTML page, which is wrong.

                       

                      In addition the content is supposed to be gzip encoded when your browser accepts this. The server only encodes the HTML page at the end, not the file that is transfered. This is what prevents the page from getting delivered to the client, because encoding obviously fails.

                       

                      This is a bug with the script on the site that creates the document.

                       

                      If you rely on this site you could create a rule that removes the"Accept-Encoding" header from the browser request for this domain, so that the webserver will not try to send responses with broken encoding. The PDF is still not ok, but it can be downloaded and most PDF viewers probably don't mind the error and will open the file anyway.

                      1 of 1 people found this helpful