java - Selenium 2: Detect content type of link destinations -

- January 15, 2014

I am using the Selenium2Java API to interact with web pages. My question is how can I find out the content type of link sites?

Actually, this is the background: before clicking on a link, I have to make sure that the response is an HTML file, if not, I have to handle it in another way. So, suppose that there is a download link for the PDF file. Instead of opening it in the browser, the contents of that URL should be read directly.

A target must have an application that automatically knows that the existing location is HTML, PDF, XML or whatever the appropriate parser to extract useful information from documents.

Updates

Added reward: This will provide me the best solution that allows me to get the content type of a given URL

As Yochan suggested, how to get content types without downloading content HTTP HEAD , and Selenium WebDrive does not seem to provide such functionality.
There is a java library which can do this, especially.
(The following code is Untested)
http client httpclient = new DefaultHttpClient (); Http head httphead = new HTTP head ("http: // foo / bar"); HttpResponse response = httpclient.execute (httphead); BasicHeader contenttypeheader = response.getFirstHeader ("content-type"); Println (contenttypeheader); The project publishes, for this it is a good example in the documentation.

Get link Facebook X Pinterest Email Other Apps

Comments Post a Comment

Search This Blog

T C SPAIN

java - Selenium 2: Detect content type of link destinations -

Comments

Post a Comment

Popular posts from this blog

qt - switch/case statement in C++ with a QString type -

python - sqlite3.OperationalError: near "REFERENCES": syntax error - foreign key creating -

Python's equivalent for Ruby's define_method? -