【问题标题】:Nginx proxy or root depending on user agentNginx 代理或根取决于用户代理
【发布时间】:2018-07-11 21:40:53
【问题描述】:

我有一个单页应用程序/渐进式 Web 应用程序,我使用 Nginx 作为静态文件提供它,并使用以下设置:

server {
        listen 80;
        listen [::]:80;

        root /var/www/mydomain.com/html;

        index index.html index.htm index.nginx-debian.html;

        server_name mydomain www.mydomain;

        location / {
                sendfile           on;
                tcp_nopush on;
                sendfile_max_chunk 1m;

                tcp_nodelay       on;
                keepalive_timeout 65;

                # First attempt to serve request as file, then
                # as directory, then fall back to displaying a 404.
                try_files $uri $uri/ /index.html;
        }

}

现在我想检查用户代理是否是,例如 Facebook 爬虫,在这种情况下我想使用本地 NodeJS 服务器的代理,我对 Nginx 不太了解,但是在谷歌搜索这个问题之后我结束了加上以下代码:

server {
    listen 80;
    listen [::]:80;

    server_name mydomain www.mydomain;

    location / {
        try_files $uri @prerender;
    }

    location @prerender {
        set $prerender 0;
        if ($http_user_agent ~* "baiduspider|twitterbot|facebookexternalhit|rogerbot|linkedinbot|embedly|quora link preview|showyoubot|outbrain|pinterest|slackbot|vkShare|W3C_Validator") {
            set $prerender 1;
        }
        if ($args ~ "_escaped_fragment_") {
            set $prerender 1;
        }
        if ($http_user_agent ~ "Prerender") {
            set $prerender 0;
        }
        if ($uri ~ "\.(js|css|xml|less|png|jpg|jpeg|gif|pdf|doc|txt|ico|rss|zip|mp3|rar|exe|wmv|doc|avi|ppt|mpg|mpeg|tif|wav|mov|psd|ai|xls|mp4|m4a|swf|dat|dmg|iso|flv|m4v|torrent|ttf|woff)") {
            set $prerender 0;
        }

        #resolve using Google's DNS server to force DNS resolution and prevent caching of IPs
        resolver 8.8.8.8;

        if ($prerender = 1) {

            #setting prerender as a variable forces DNS resolution since nginx caches IPs and doesnt play well with load balancing
            set $prerender "localhost:3030";


            rewrite .* /$scheme://$host$request_uri? break;
            proxy_pass http://$prerender;


        }
        if ($prerender = 0) {
            # First attempt to serve request as file, then
            # as directory, then fall back to displaying a 404.
            root /var/www/mydomain.com/html;
        }
    }
}

上面的代码不起作用,我不知道为什么会非常感谢任何帮助。

【问题讨论】:

    标签: node.js nginx prerender


    【解决方案1】:

    这是对我有用的最终代码,以防万一有人遇到同样的问题:

    server {
        listen 80;
        listen [::]:80;
    
        server_name mydomain.com www.mydomain.com;
    
        location / {
            set $prerender 0;
            if ($http_user_agent ~* "baiduspider|twitterbot|facebookexternalhit|rogerbot|linkedinbot|embedly|quora link preview|showyoubot|outbrain|pinterest|slackbot|vkShare|W3C_Validator") {
                set $prerender 1;
            }
            if ($args ~ "_escaped_fragment_") {
                set $prerender 1;
            }
            if ($http_user_agent ~ "Prerender") {
                set $prerender 0;
            }
            if ($uri ~ "\.(js|css|xml|less|png|jpg|jpeg|gif|pdf|doc|txt|ico|rss|zip|mp3|rar|exe|wmv|doc|avi|ppt|mpg|mpeg|tif|wav|mov|psd|ai|xls|mp4|m4a|swf|dat|dmg|iso|flv|m4v|torrent|ttf|woff)") {
                set $prerender 0;
            }
    
            #resolve using Google's DNS server to force DNS resolution and prevent caching of IPs
            resolver 8.8.8.8;
    
            if ($prerender = 1) {
    
                #setting prerender as a variable forces DNS resolution since nginx caches IPs and doesnt play well with load balancing
                set $prerender "localhost:3030";
                proxy_pass http://$prerender;
    
            }
    
            root /var/www/mydomain.com/html;
            try_files $uri $uri/ /index.html;
        }
    }
    

    【讨论】:

      【解决方案2】:

      你可以试试这个吗?

      server {
          listen 80;
          listen [::]:80;
      
          server_name mydomain www.mydomain;
      
          location / {
              set $prerender 0;
              if ($http_user_agent ~* "baiduspider|twitterbot|facebookexternalhit|rogerbot|linkedinbot|embedly|quora link preview|showyoubot|outbrain|pinterest|slackbot|vkShare|W3C_Validator") {
                  set $prerender 1;
              }
              if ($args ~ "_escaped_fragment_") {
                  set $prerender 1;
              }
              if ($http_user_agent ~ "Prerender") {
                  set $prerender 0;
              }
              if ($uri ~ "\.(js|css|xml|less|png|jpg|jpeg|gif|pdf|doc|txt|ico|rss|zip|mp3|rar|exe|wmv|doc|avi|ppt|mpg|mpeg|tif|wav|mov|psd|ai|xls|mp4|m4a|swf|dat|dmg|iso|flv|m4v|torrent|ttf|woff)") {
                  set $prerender 0;
              }
      
              #resolve using Google's DNS server to force DNS resolution and prevent caching of IPs
              resolver 8.8.8.8;
      
              if ($prerender = 1) {
      
                  #setting prerender as a variable forces DNS resolution since nginx caches IPs and doesnt play well with load balancing
                  set $prerender "localhost:3030";
      
      
                  rewrite .* /$scheme://$host$request_uri? break;
                  proxy_pass http://$prerender;
      
      
              }
      
              try_files $uri $uri/ /index.html;
          }
      }
      

      【讨论】:

      • 感谢您的帮助,当我使用您的代码而不更改任何内容时,它根本不起作用,并且默认 Nginx 页面始终呈现,但是当我在 try_files $uri $uri/ /index.html; 之前添加 root /var/www/mydomain.com/html; 时它起作用了当我从网络浏览器访问它时,它会正确地呈现 /var/www/mydomain.com/html 中的静态文件,但是当我从 Facebook 或 Google 爬虫访问网站时,我收到类似 Error Cannot GET mydomain.com/path 的错误,知道这是为什么吗?
      • 当爬虫访问该URL时,它会响应404响应码!
      • 这个配置真的很糟糕,真的很糟糕。位置上下文中的多个 if 条件,无意义的重写规则,没有根指令。这就像一个如何不配置 Nginx 的完美示例。请在部署之前阅读nginx.com/resources/wiki/start/topics/depth/ifisevilnginx.com/resources/wiki/start/topics/tutorials/config_pitfalls
      • @miknik 谢谢你,你认为如何解决这个问题?
      猜你喜欢
      • 2012-05-24
      • 1970-01-01
      • 1970-01-01
      • 2021-09-06
      • 2020-01-05
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      相关资源
      最近更新 更多