Advanced HTML Guide

(This Advanced HTML Guide was last modified February 19, 2000. Please send any comments or suggestions to Prof. Dunsmore.)

TRANSPARENT BACKGROUNDS FOR IMAGES

Many images look better if their background color matches browser window

Image appears to float in window

Could change background color of image

BUT ... some browsers set their own background color

Best Solution -- Create .gif image with one color as "transparent"

Tells browser to use background color for those color pixels in image

ppmtogif has "transparent" option

New Image --

tifftopnm < globe.tif | ppmtogif -trans #FF0000 > newglobe.gif

Existing Image --

giftopnm globe.gif | ppmtogif -trans #FF0000 > newglobe.gif

Image with Red Background

Image with Transparent Red Background

JPEG does not support transparent pixels because of the way it stores and uses pixel colors

DYNAMIC DOCUMENTS

SERVER PUSH --

Server sends down page

Browser displays page but leaves connection open

Whenever server wants, sends another page and browser displays it

CLIENT PULL --

Server sends down page including directive that says "reload page in 5 seconds" or "load another URL in 10 seconds"

Client does as told, either reloading current page or getting another one

Dynamic Documents via Client Pull

Reload Page

Document reloads itself once a second

Document reloads itself every 15 seconds

Make sure META tag used inside HEAD of HTML document

Directive doesn't say "go get this page every 15 seconds from now until infinity"

Says "go get this page in 15 seconds"

If you set up "infinite reload" situation, only way to interrupt is by pressing Back button or by going to different URL

To Load Another URL

Loads doc2.html in 8 seconds

Make sure URL is fully-qualified

Don't use relative URL

Can dynamically load several documents consecutively this way

Last document should have no "Refresh" in it

HELPING SEARCH ENGINES FIND YOUR WEBSITE

Many search engines rely heavily on the TITLE

Use keywords in the TITLE

<TITLE> International Programs (study abroad, students, scholars, campus seminars, faculty resource guide) </TITLE>

Commas are ignored for keywords

Many search engines use the first 200-300 words for extracting a description and identifying keywords

Some search engines (for example, InfoSeek, AltaVista) will let you specify these yourself using META tag attribute NAME

Remember: META tag must be in HEAD of HTML document

Some search engines (for example, Excite) pay attention to comments as well as regular text

Use a comment to specify keywords without destroying the readability of the page

MULTIPLE TEXT COLUMNS

Netscape allows for newspaper-style columns

Put MULTICOL container around text to put in multiple columns

COLS attribute specifies number of columns

Each column is same width (varies depending on browser screen size)

GUTTER attribute specifies spacing between columns (in pixels) (optional -- defaults to 10)

This crucial Civil War campaign (April-July 1863) captured the last Confederate fortress on the Mississippi River, divided the Confederacy in two, and gave the Union complete control of the river. This victory, along with that at Gettysburg, Pennsylvania, (July 1-3, 1863) tilted the war in favor of the North. <MULTICOL COLS=3 GUTTER=7> ... </MULTICOL>

FRAMES

Page or screen can be divided into sections called frames

Each frame contains its own unique URL, allowing users to scroll through multiple sites simultaneously

Makes possible banners, ledges, tables of contents

Queries executed in one frame can generate results in another frame

Reduces need to jump back and forth between screens

Frame can function as ledge, constant area of the screen, keeping important information out front while the user scrolls through pages

HTML Guide Demo

FRAME DOCUMENT

Frame Document has basic structure much like normal HTML document, except BODY container replaced by FRAMESET container

<HTML>

</HTML>

FRAME SYNTAX

<FRAMESET>

Has 2 attributes ROWS and COLS

Frame document has no BODY

FRAMESET tag has matching /FRAMESET tag

Within FRAMESET can only have other nested FRAMESET tags, FRAME tags, or NOFRAMES tag

ROWS="list"

ROWS attribute takes comma-separated list of values

Best to use percentage values

If total is not 100%, all percentages will be scaled to match total of 100%

COLS="list"

COLS attribute takes comma-separated list of values

Best to use percentage values

If total is not 100%, all percentages will be scaled to match total of 100%

FRAMESET tag can be nested inside other FRAMESET tags

<FRAME>

Defines single frame in frameset

Some selected attributes --

SRC="url"

SRC attribute takes URL of document to be displayed in this frame

NAME="window_name"

NAME attribute assigns name to frame so it can be targeted by links in other documents ... usually from other frames in same document

NAME attribute is optional

Named frames can have window contents targeted with TARGET attribute

<BASE TARGET="window_name">

Used when all (or most) links in frame targeted to same window

TARGET attribute establishes default window_name

<A HREF="cows.html">Cattle</A> will open cows.html in default window

<A HREF="cows.html" TARGET="ledge">Cattle</A> will open cows.html in window named "ledge"

<A HREF="cows.html" TARGET="_self">Cattle</A> will open cows.html in same frame the link is in -- useful for occasionally overriding a BASE target

<A HREF="cows.html" TARGET="_top">Cattle</A> will open cows.html in non-framed full-screen window -- useful for breaking out of frame nesting

SCROLLING="yes|no|auto"

SCROLLING attribute describes if frame should have scrollbar or not

yes = scrollbars always visible for frame

no = scrollbars never visible

auto = browser decides whether scrollbars needed

SCROLLING attribute is optional

Default value is auto

NORESIZE

NORESIZE attribute is flag indicating frame is not resizable by user

By default all frames are resizable

<NOFRAMES>

NOFRAMES tag is for alternative content viewable by non-Frame-capable clients

By default, frames have BORDER=5

But, this can be changed with BORDER=n in outermost FRAMESET

BORDER=0 leads to frames with no borders at all

Need for scrollbars overrides BORDER=0

CGI-BIN

CGI stands for Common Gateway Interface

Allows HTTP server to run programs that generate dynamic documents, rather than being restricted to delivering only static documents

To enable CGI on a Web server, Web Master designates directory as CGI-bin directory

"bin" comes from old convention that directory containing executable programs has "bin" (for BINary code) in its name -- /bin, /usr/bin, /usr/local/www/bin

Often programs named with suffix .cgi (like search.cgi)

When client requests document from this directory, rather than sending program to client, Web server runs program and sends output from program to client

Should use CGI programming whenever you need to get input from user or whenever information you want to send is dynamic

For example, Rounder Records Catalog Search facility uses CGI-bin program to get search criteria, perform search, and return resulting information to client

Try searching for Artist / Musician "Little Jimmy"

Any programming language that is supported on server machine can be used for CGI programs

Popular languages are Java, PERL (Practical Extraction and Report Language), TCL (Tool Command Language), C, and C++

IMAGE MAPS

Imagemap program is usually part of cgi-bin directory

(1) Create an image

(2) Create an imagemap file

File maps regions in the image to URLs

Imagemap is collection of polygons, rectangles, and circles

Try to keep individual items in image spaced out far enough so user clearly knows what he or she is clicking on

Every line in imagemap file:

method URL coord1 coord2 ... coordn

method is one of the following:

default -- default URL -- coordinates: none (if click is not in a polygon, rectangle, or circle)

circle -- coordinates: center edgepoint

poly -- coordinates: up to 100 vertices in the order in which you want them to appear

rect - coordinates: upper-left lower-right

URL must be full (not relative) URL

Each method is evaluated in order placed in imagemap file

For overlapping areas (circle inside of a rectangle), place whichever one you want evaluated first before the other in the imagemap file

default http://www.cs.purdue.edu/homes/bxd/special.html

poly http://www.purdue.edu/OIP/iss/isshome.html 395,184 658,188 592,325 538,337 466,320 357,253

rect http://www.purdue.edu/UPS/Student/picc/ 237,59 343,184

Imagemap file can be named anything but usually uses suffix .map (like world.map)

(3) Reference your imagemap in an HTML file

Office of International Programs Demo

imagemap file world.map

A good way to identify pixel locations in an image used in a clickable image map...

Put the image in an HTML file between <A HREF> and </A> with the IMG attribute ISMAP

Bring up that page and notice that most Web browsers tell you the x,y location of any point....

CLIENT SIDE IMAGE MAPS

Limitations of server side image map implementation:

(1) Server transaction required to determine where link is directed

(2) No way for browser to show where portion of image map leads before user clicks on it

Client Side Image Map Demo

USEMAP attribute of IMG element indicates client-side image map

Argument to USEMAP specifies which map to use with image

Format similar to HREF attribute on anchors

If argument to USEMAP starts with a "#", assumed to be in the same document as the IMG tag

Regions of image are described using MAP element

</MAP>

Name specifies name of map so can be referenced by IMG element

Shape gives shape of area

Currently supported shapes include rectangles (RECT), polygons (POLY), circles (CIRCLE), and default (DEFAULT)

RECT - coordinates: upper-left lower-right

POLY -- coordinates: up to 100 vertices in any order

CIRCLE -- coordinates: center radius (NOTE! Differs from server-side image map which uses center and edgepoint)

DEFAULT usually unnecessary -- any region of image not defined by AREA tag assumed to do nothing

If DEFAULT is used to map to some other URL, put it last -- otherwise, nothing after it works (NOTE! Differs from server-side image map which uses DEFAULT in more sane manner)

COORDS tag gives coordinates of shape

HREF tag specifies where click in that area should lead

Unlike Server Side, HREF can be relative URL

</MAP>

FORMS

HTML forms are used to get information from user before giving requested information

HTML Form Elements

<FORM> directive indicates beginning of HTML form

Should be followed by </FORM>

Document may contain one or more HTML forms

Sub elements of the <FORM> directive:

ACTION="/cgi-bin/program.cgi" -- URL of CGI program that will process output of form

METHOD="GET|POST" -- Method by which CGI Program will get output from form

GET method places form output on URL in query string -- http://.../cgi-bin/search.cgi?bob+smith

POST method places output in stdin input file of CGI program

<INPUT> directive defines an input field

TYPE=TEXT -- Text field that accepts character data (default)

TYPE=PASSWORD -- Text field that accepts character data, but does not display it

TYPE=CHECKBOX -- Field that is either "on" or "off"

TYPE=RADIO -- Only one of group of radio buttons can be "on", others are automatically turned off

TYPE=SUBMIT -- Sends the form to the URL defined in the ACTION option of the <FORM> directive

TYPE=IMAGE NAME=SUBMIT SRC="..." -- also sends the form information just like a Submit button

TYPE=RESET -- Browser resets form fields to default values

NAME="name" -- Variable name associated with value of input field

SIZE=size -- Length in characters of input field (TEXT and PASSWORD input fields only)

MAXLENGTH=max -- Maximum number of characters for input field (TEXT and PASSWORD input fields only)

CHECKED -- Field should be selected or checked by default (CHECKBOX and RADIO input fields only)

VALUE=default input text displayed originally (TEXT and PASSWORD input fields only)

VALUE=value returned when field is checked or selected (CHECKBOX and RADIO input fields only)

VALUE=text displayed on button (SUBMIT and RESET input fields only) -- Defaults are "Submit Query" and "Reset"

<TEXTAREA> directive defines large text input field

Any text between <TEXTAREA> and </TEXTAREA> directive will be default text displayed originally

<TEXTAREA> Options:

NAME="name" -- Variable name associated with text in input area

WRAP=PHYSICAL -- The display word-wraps. The default is WRAP=OFF

ROWS=rows -- Rows of text in input area

COLS=cols -- Columns of text in input area

<SELECT> and <OPTION> directives

Must be used together

<OPTION> directive defines selection item

Multiple <OPTION> directives usually specified

Must be between <SELECT> and </SELECT> directives

Options to <SELECT> directive:

NAME="name" -- Variable name associated with returned selection

SIZE=num -- Number of options displayed at any one time

MULTIPLE -- More than one selection can be made

Options to <OPTION> directive:

SELECTED -- This option selected by default

INPUT TYPE=TEXT