需要帮助了解URL别名以提供文本文件

时间:2014-01-08 23:43:13

标签: php apache .htaccess drupal

这是我第一次设置别名,并找到了几个如何使用它的示例,但出于某种原因,我仍然不能100%确信语法应该如何工作。今天我一直在花费大部分时间寻找一个具体的例子,但没有运气。哈哈,我想我真的需要一手抓住这个。以下是我为读者发现的一些示例:

httpd.apache.org/docs/2.2/urlmapping.html

net.tutsplus.com/tutorials/apache-aliasing-and-redirection /

serverfault.com/questions/362040/accessing-directories-outside-of-documentroot

这是我的情况: 首先,我正在使用的远程服务器的Web根目录是:

'/data1/home/spaceweather/htdoc'

我要链接到此位置的数据文件存档:

'/data1/public'

我有大约20,000个文本文件,最终将由公众通过我正在开发的网站进行搜索。包含所有文件的目录结构如下所示:

data1/
     /public/
            /cases/
                  /grid108/
                          <here there are folders, one for every day of the year>
                          <which contains the text files>
                  /grid112/
                          <here there are folders, one for every day of the year>
                          <which contains the text files>
                  /grid180/
                          <here there are folders, one for every day of the year>
                          <which contains the text files>

一个示例路径一直到单个文本文件看起来像这样:

'/data1/public/cases/grid108/144/txt/channel_dataout_2013_144_0000.log'

上面的路径肯定在我的web根文件夹之外,所以这就是为什么我认为别名是最好的选择。在线指南提到要创建别名,我需要修改我的.htaccess文件。所以我继续找到我的.htaccess文件(代码如下所示)。你会注意到Drupal在下面的代码中散布的字样,这是因为我正在制作的网站是由Drupal提供的。

我的.htaccess文件:

#
# Apache/PHP/Drupal settings:
#

# Protect files and directories from prying eyes.
<FilesMatch "\.(engine|inc|info|install|make|module|profile|test|po|sh|.*sql|theme|tpl(\.php)?|xtmpl)(~|\.sw[op]|\.bak|\.orig|\.save)?$|^(\..*|Entries.*|Repository|Root|Tag|Template)$|^#.*#$|\.php(~|\.sw[op]|\.bak|\.orig\.save)$">
  Order allow,deny
</FilesMatch>

# Don't show directory listings for URLs which map to a directory.
Options -Indexes

# Follow symbolic links in this directory.
Options +FollowSymLinks

# Make Drupal handle any 404 errors.
ErrorDocument 404 /index.php

# Set the default handler.
DirectoryIndex index.php index.html index.htm

# Override PHP settings that cannot be changed at runtime. See
# sites/default/default.settings.php and drupal_environment_initialize() in
# includes/bootstrap.inc for settings that can be changed at runtime.

# PHP 5, Apache 1 and 2.
<IfModule mod_php5.c>
  php_flag magic_quotes_gpc                 off
  php_flag magic_quotes_sybase              off
  php_flag register_globals                 off
  php_flag session.auto_start               off
  php_value mbstring.http_input             pass
  php_value mbstring.http_output            pass
  php_flag mbstring.encoding_translation    off
</IfModule>

# Requires mod_expires to be enabled.
<IfModule mod_expires.c>
  # Enable expirations.
  ExpiresActive On

  # Cache all files for 2 weeks after access (A).
  ExpiresDefault A1209600

  <FilesMatch \.php$>
    # Do not allow PHP scripts to be cached unless they explicitly send cache
    # headers themselves. Otherwise all scripts would have to overwrite the
    # headers set by mod_expires if they want another caching behavior. This may
    # fail if an error occurs early in the bootstrap process, and it may cause
    # problems if a non-Drupal PHP file is installed in a subdirectory.
    ExpiresActive Off
  </FilesMatch>
</IfModule>

# Various rewrite rules.
<IfModule mod_rewrite.c>
  RewriteEngine on

  # Set "protossl" to "s" if we were accessed via https://.  This is used later
  # if you enable "www." stripping or enforcement, in order to ensure that
  # you don't bounce between http and https.
  RewriteRule ^ - [E=protossl]
  RewriteCond %{HTTPS} on
  RewriteRule ^ - [E=protossl:s]

  # Make sure Authorization HTTP header is available to PHP
  # even when running as CGI or FastCGI.
  RewriteRule ^ - [E=HTTP_AUTHORIZATION:%{HTTP:Authorization}]

  # Block access to "hidden" directories whose names begin with a period. This
  # includes directories used by version control systems such as Subversion or
  # Git to store control files. Files whose names begin with a period, as well
  # as the control files used by CVS, are protected by the FilesMatch directive
  # above.
  #
  # NOTE: This only works when mod_rewrite is loaded. Without mod_rewrite, it is
  # not possible to block access to entire directories from .htaccess, because
  # <DirectoryMatch> is not allowed here.
  #
  # If you do not have mod_rewrite installed, you should remove these
  # directories from your webroot or otherwise protect them from being
  # downloaded.
  RewriteRule "(^|/)\." - [F]

  # If your site can be accessed both with and without the 'www.' prefix, you
  # can use one of the following settings to redirect users to your preferred
  # URL, either WITH or WITHOUT the 'www.' prefix. Choose ONLY one option:
  #
  # To redirect all users to access the site WITH the 'www.' prefix,
  # (http://example.com/... will be redirected to http://www.example.com/...)
  # uncomment the following:
  # RewriteCond %{HTTP_HOST} .
  # RewriteCond %{HTTP_HOST} !^www\. [NC]
  # RewriteRule ^ http%{ENV:protossl}://www.%{HTTP_HOST}%{REQUEST_URI} [L,R=301]
  #
  # To redirect all users to access the site WITHOUT the 'www.' prefix,
  # (http://www.example.com/... will be redirected to http://example.com/...)
  # uncomment the following:
  # RewriteCond %{HTTP_HOST} ^www\.(.+)$ [NC]
  # RewriteRule ^ http%{ENV:protossl}://%1%{REQUEST_URI} [L,R=301]

  # Modify the RewriteBase if you are using Drupal in a subdirectory or in a
  # VirtualDocumentRoot and the rewrite rules are not working properly.
  # For example if your site is at http://example.com/drupal uncomment and
  # modify the following line:
  # RewriteBase /drupal
  #
  # If your site is running in a VirtualDocumentRoot at http://example.com/,
  # uncomment the following line:
  # RewriteBase /

  # Pass all requests not referring directly to files in the filesystem to
  # index.php. Clean URLs are handled in drupal_environment_initialize().
  RewriteCond %{REQUEST_FILENAME} !-f
  RewriteCond %{REQUEST_FILENAME} !-d
  RewriteCond %{REQUEST_URI} !=/favicon.ico
  RewriteRule ^ index.php [L]

  # Rules to correctly serve gzip compressed CSS and JS files.
  # Requires both mod_rewrite and mod_headers to be enabled.
  <IfModule mod_headers.c>
    # Serve gzip compressed CSS files if they exist and the client accepts gzip.
    RewriteCond %{HTTP:Accept-encoding} gzip
    RewriteCond %{REQUEST_FILENAME}\.gz -s
    RewriteRule ^(.*)\.css $1\.css\.gz [QSA]

    # Serve gzip compressed JS files if they exist and the client accepts gzip.
    RewriteCond %{HTTP:Accept-encoding} gzip
    RewriteCond %{REQUEST_FILENAME}\.gz -s
    RewriteRule ^(.*)\.js $1\.js\.gz [QSA]

    # Serve correct content types, and prevent mod_deflate double gzip.
    RewriteRule \.css\.gz$ - [T=text/css,E=no-gzip:1]
    RewriteRule \.js\.gz$ - [T=text/javascript,E=no-gzip:1]

    <FilesMatch "(\.js\.gz|\.css\.gz)$">
      # Serve correct encoding type.
      Header set Content-Encoding gzip
      # Force proxies to cache gzipped & non-gzipped css/js files separately.
      Header append Vary Accept-Encoding
    </FilesMatch>
  </IfModule>
</IfModule>

现在是我迷失的部分。这是我假设将起作用的。如果我从上述文件中的最后一行代码开始并放入Alias /cases/data/ "/data1/public/cases/",则会重定向该网址:

apollo.tbc.iit.edu/~spaceweather/cases/data/grid108/144/txt/channel_dataout_2013_144_0000.log

apollo.tbc.iit.edu/~spaceweather/data1/public/cases/grid108/144/txt/channel_dataout_2013_144_0000.log

这就是为什么这种方法对我没有意义。这段代码如何告诉Web服务器在Web根目录之外传递文件?如何获得如下所示的网址:

apollo.tbc.iit.edu/~spaceweather/data/cases/grid108/144/txt/channel_dataout_2013_144_0000.log

将用户带到位于以下位置的文件:

/data1/public/cases/grid108/144/txt/channel_dataout_2013_144_0000.log

当我将此文件放入我的htdocs文件夹并将浏览器指向apollo.tbc.iit.edu/~spaceweather/channel_dataout_2013_144_0000.log时,它会按预期在浏览器中显示该文件。

http://apollo.tbc.iit.edu/~spaceweather网站是实时的,可供任何将浏览器指向它的人查看。因此,上面段落中的链接也可以正常工作,并将文件显示给世界。

感谢您的帮助,请保持温和。

- ZealotSveta

1 个答案:

答案 0 :(得分:0)

我认为你可能会让事情变得过于复杂。

据我所知,您需要从docroot到数据文件夹的符号链接。

使用以下命令创建一个:

mkdir -p /data1/home/spaceweather/htdoc/cases #just in case it does not exist
ln -s  /data1/public/cases /data1/home/spaceweather/htdoc/cases/data 

这将创建一个名为“data”的符号链接,它将指向您的/ data1 / public / cases目录。

如果您在该目录中ls -l,您应该会看到类似

的内容
data -> /data1/public/cases

尝试cd进入data目录以验证其是否有效。


这对你来说不是必需的,因为你已经在你的.htaccess中了,但无论如何都把它留给其他人:

如果首先无法通过网络浏览器进行操作,则可能需要将FollowSymLinks添加到您的htaccess:Options +FollowSymLinks。这告诉Apache可以使用你创建的符号链接。

希望这会有所帮助。如果我误解了您的情况或要求,请说出来,我会进行适当的编辑。